Punching above its weight: a head-to-head comparison of deepseek-R1 and OpenAI-o1 on pancreatic adenocarcinoma-related questions
Objective: This study aimed to compare the performance of DeepSeek-R1 and OpenAI-o1 in addressing complex pancreatic ductal adenocarcinoma (PDAC)-related clinical questions, focusing on accuracy, comprehensiveness, safety, and reasoning quality. Methods: Twenty PDAC-related questions derived from th...
Gespeichert in:
| Hauptverfasser: | , , , , , , |
|---|---|
| Dokumenttyp: | Article (Journal) |
| Sprache: | Englisch |
| Veröffentlicht: |
2025-8-22
|
| In: |
International journal of medical sciences
Year: 2025, Jahrgang: 22, Heft: 15, Pages: 3868-3877 |
| ISSN: | 1449-1907 |
| DOI: | 10.7150/ijms.118887 |
| Online-Zugang: | Verlag, kostenfrei, Volltext: https://doi.org/10.7150/ijms.118887 Verlag, kostenfrei, Volltext: https://www.medsci.org/v22p3868.htm |
| Verfasserangaben: | Cheng-Peng Li, Yuan Chu, Wei-Wei Jia, Priska Hakenberg, Flavius Șandra-Petrescu, Christoph Reißfelder, Cui Yang |
MARC
| LEADER | 00000naa a2200000 c 4500 | ||
|---|---|---|---|
| 001 | 1939897157 | ||
| 003 | DE-627 | ||
| 005 | 20251103105811.0 | ||
| 007 | cr uuu---uuuuu | ||
| 008 | 251103s2025 xx |||||o 00| ||eng c | ||
| 024 | 7 | |a 10.7150/ijms.118887 |2 doi | |
| 035 | |a (DE-627)1939897157 | ||
| 035 | |a (DE-599)KXP1939897157 | ||
| 040 | |a DE-627 |b ger |c DE-627 |e rda | ||
| 041 | |a eng | ||
| 084 | |a 33 |2 sdnb | ||
| 100 | 1 | |a Li, Cheng-Peng |e VerfasserIn |0 (DE-588)1363080296 |0 (DE-627)1922902276 |4 aut | |
| 245 | 1 | 0 | |a Punching above its weight |b a head-to-head comparison of deepseek-R1 and OpenAI-o1 on pancreatic adenocarcinoma-related questions |c Cheng-Peng Li, Yuan Chu, Wei-Wei Jia, Priska Hakenberg, Flavius Șandra-Petrescu, Christoph Reißfelder, Cui Yang |
| 264 | 1 | |c 2025-8-22 | |
| 300 | |b Diagramme | ||
| 300 | |a 10 | ||
| 336 | |a Text |b txt |2 rdacontent | ||
| 337 | |a Computermedien |b c |2 rdamedia | ||
| 338 | |a Online-Ressource |b cr |2 rdacarrier | ||
| 500 | |a Gesehen am 03.11.2025 | ||
| 520 | |a Objective: This study aimed to compare the performance of DeepSeek-R1 and OpenAI-o1 in addressing complex pancreatic ductal adenocarcinoma (PDAC)-related clinical questions, focusing on accuracy, comprehensiveness, safety, and reasoning quality. Methods: Twenty PDAC-related questions derived from the up-to-date NCCN guidelines for PDAC were posed to both models. Responses were evaluated for accuracy, comprehensiveness, and safety, and chain-of-thought (CoT) outputs were rated for logical coherence and error handling by blinded clinical experts using 5-point Likert scales. Inter-rater reliability, evaluated scores, and character counts by both models were compared. Results: Both models demonstrated high accuracy (median score: 5 vs. 5, p=0.527) and safety (5 vs. 5, p=0.285). DeepSeek-R1 outperformed OpenAI-o1 in comprehensiveness (median: 5 vs. 4.5, p=0.015) and generated significantly longer responses (median characters: 544 vs. 248, p<0.001). For reasoning quality, DeepSeek-R1 achieved superior scores in logical coherence (median: 5 vs. 4, p<0.001) and error handling (5 vs. 4, p<0.001), with 75% of its responses scoring full points compared to OpenAI-o1's 5%. Conclusion: While both models exhibit high clinical utility, DeepSeek-R1's enhanced reasoning capabilities, open-source nature, and cost-effectiveness position it as a promising tool for complex oncology decision support. Further validation in real-world multimodal clinical scenarios is warranted. | ||
| 700 | 1 | |a Chu, Yuan |d 1994- |e VerfasserIn |0 (DE-588)1373626186 |0 (DE-627)1932986146 |4 aut | |
| 700 | 1 | |a Jia, Wei-Wei |e VerfasserIn |4 aut | |
| 700 | 1 | |a Hakenberg, Priska |d 1990- |e VerfasserIn |0 (DE-588)1196385238 |0 (DE-627)1678166952 |4 aut | |
| 700 | 1 | |a Şandra-Petrescu, Flavius Ionuţ |d 1977- |e VerfasserIn |0 (DE-588)1029219907 |0 (DE-627)732605490 |0 (DE-576)377147753 |4 aut | |
| 700 | 1 | |a Reißfelder, Christoph |d 1975- |e VerfasserIn |0 (DE-588)1025566211 |0 (DE-627)722940297 |0 (DE-576)370516044 |4 aut | |
| 700 | 1 | |a Yang, Cui |d 1984- |e VerfasserIn |0 (DE-588)1136151982 |0 (DE-627)891949968 |0 (DE-576)490363180 |4 aut | |
| 773 | 0 | 8 | |i Enthalten in |t International journal of medical sciences |d Wyoming, NSW : Ivyspring International Publ., 2004 |g 22(2025), 15, Seite 3868-3877 |h Online-Ressource |w (DE-627)390964174 |w (DE-600)2151424-0 |w (DE-576)281254109 |x 1449-1907 |7 nnas |a Punching above its weight a head-to-head comparison of deepseek-R1 and OpenAI-o1 on pancreatic adenocarcinoma-related questions |
| 773 | 1 | 8 | |g volume:22 |g year:2025 |g number:15 |g pages:3868-3877 |g extent:10 |a Punching above its weight a head-to-head comparison of deepseek-R1 and OpenAI-o1 on pancreatic adenocarcinoma-related questions |
| 856 | 4 | 0 | |u https://doi.org/10.7150/ijms.118887 |x Verlag |x Resolving-System |z kostenfrei |3 Volltext |
| 856 | 4 | 0 | |u https://www.medsci.org/v22p3868.htm |x Verlag |z kostenfrei |3 Volltext |
| 951 | |a AR | ||
| 992 | |a 20251103 | ||
| 993 | |a Article | ||
| 994 | |a 2025 | ||
| 998 | |g 1136151982 |a Yang, Cui |m 1136151982:Yang, Cui |d 60000 |d 61800 |e 60000PY1136151982 |e 61800PY1136151982 |k 0/60000/ |k 1/60000/61800/ |p 7 |y j | ||
| 998 | |g 1025566211 |a Reißfelder, Christoph |m 1025566211:Reißfelder, Christoph |d 60000 |d 61800 |d 50000 |e 60000PR1025566211 |e 61800PR1025566211 |e 50000PR1025566211 |k 0/60000/ |k 1/60000/61800/ |k 0/50000/ |p 6 | ||
| 998 | |g 1029219907 |a Şandra-Petrescu, Flavius Ionuţ |m 1029219907:Şandra-Petrescu, Flavius Ionuţ |d 60000 |d 61800 |e 60000PS1029219907 |e 61800PS1029219907 |k 0/60000/ |k 1/60000/61800/ |p 5 | ||
| 998 | |g 1196385238 |a Hakenberg, Priska |m 1196385238:Hakenberg, Priska |d 60000 |e 60000PH1196385238 |k 0/60000/ |p 4 | ||
| 998 | |g 1373626186 |a Chu, Yuan |m 1373626186:Chu, Yuan |d 60000 |e 60000PC1373626186 |k 0/60000/ |p 2 | ||
| 999 | |a KXP-PPN1939897157 |e 4795689636 | ||
| BIB | |a Y | ||
| SER | |a journal | ||
| JSO | |a {"person":[{"family":"Li","display":"Li, Cheng-Peng","given":"Cheng-Peng","role":"aut"},{"display":"Chu, Yuan","family":"Chu","role":"aut","given":"Yuan"},{"display":"Jia, Wei-Wei","family":"Jia","role":"aut","given":"Wei-Wei"},{"role":"aut","given":"Priska","display":"Hakenberg, Priska","family":"Hakenberg"},{"role":"aut","given":"Flavius Ionuţ","display":"Şandra-Petrescu, Flavius Ionuţ","family":"Şandra-Petrescu"},{"display":"Reißfelder, Christoph","family":"Reißfelder","given":"Christoph","role":"aut"},{"given":"Cui","role":"aut","family":"Yang","display":"Yang, Cui"}],"language":["eng"],"type":{"bibl":"article-journal","media":"Online-Ressource"},"title":[{"title_sort":"Punching above its weight","subtitle":"a head-to-head comparison of deepseek-R1 and OpenAI-o1 on pancreatic adenocarcinoma-related questions","title":"Punching above its weight"}],"note":["Gesehen am 03.11.2025"],"origin":[{"dateIssuedDisp":"2025-8-22","dateIssuedKey":"2025"}],"id":{"doi":["10.7150/ijms.118887"],"eki":["1939897157"]},"recId":"1939897157","relHost":[{"title":[{"title":"International journal of medical sciences","title_sort":"International journal of medical sciences"}],"note":["Gesehen am 01.10.2020"],"origin":[{"dateIssuedDisp":"2004-","publisher":"Ivyspring International Publ.","dateIssuedKey":"2004","publisherPlace":"Wyoming, NSW"}],"pubHistory":["1.2004 -"],"type":{"media":"Online-Ressource","bibl":"periodical"},"disp":"Punching above its weight a head-to-head comparison of deepseek-R1 and OpenAI-o1 on pancreatic adenocarcinoma-related questionsInternational journal of medical sciences","recId":"390964174","physDesc":[{"extent":"Online-Ressource"}],"id":{"eki":["390964174"],"zdb":["2151424-0"],"issn":["1449-1907"]},"language":["eng"],"part":{"year":"2025","pages":"3868-3877","issue":"15","text":"22(2025), 15, Seite 3868-3877","volume":"22","extent":"10"}}],"physDesc":[{"extent":"10 S.","noteIll":"Diagramme"}],"name":{"displayForm":["Cheng-Peng Li, Yuan Chu, Wei-Wei Jia, Priska Hakenberg, Flavius Șandra-Petrescu, Christoph Reißfelder, Cui Yang"]}} | ||
| SRT | |a LICHENGPENPUNCHINGAB2025 | ||