Punching above its weight: a head-to-head comparison of deepseek-R1 and OpenAI-o1 on pancreatic adenocarcinoma-related questions

Objective: This study aimed to compare the performance of DeepSeek-R1 and OpenAI-o1 in addressing complex pancreatic ductal adenocarcinoma (PDAC)-related clinical questions, focusing on accuracy, comprehensiveness, safety, and reasoning quality. Methods: Twenty PDAC-related questions derived from th...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Li, Cheng-Peng (VerfasserIn) , Chu, Yuan (VerfasserIn) , Jia, Wei-Wei (VerfasserIn) , Hakenberg, Priska (VerfasserIn) , Şandra-Petrescu, Flavius Ionuţ (VerfasserIn) , Reißfelder, Christoph (VerfasserIn) , Yang, Cui (VerfasserIn)
Dokumenttyp: Article (Journal)
Sprache:Englisch
Veröffentlicht: 2025-8-22
In: International journal of medical sciences
Year: 2025, Jahrgang: 22, Heft: 15, Pages: 3868-3877
ISSN:1449-1907
DOI:10.7150/ijms.118887
Online-Zugang:Verlag, kostenfrei, Volltext: https://doi.org/10.7150/ijms.118887
Verlag, kostenfrei, Volltext: https://www.medsci.org/v22p3868.htm
Volltext
Verfasserangaben:Cheng-Peng Li, Yuan Chu, Wei-Wei Jia, Priska Hakenberg, Flavius Șandra-Petrescu, Christoph Reißfelder, Cui Yang

MARC

LEADER 00000naa a2200000 c 4500
001 1939897157
003 DE-627
005 20251103105811.0
007 cr uuu---uuuuu
008 251103s2025 xx |||||o 00| ||eng c
024 7 |a 10.7150/ijms.118887  |2 doi 
035 |a (DE-627)1939897157 
035 |a (DE-599)KXP1939897157 
040 |a DE-627  |b ger  |c DE-627  |e rda 
041 |a eng 
084 |a 33  |2 sdnb 
100 1 |a Li, Cheng-Peng  |e VerfasserIn  |0 (DE-588)1363080296  |0 (DE-627)1922902276  |4 aut 
245 1 0 |a Punching above its weight  |b a head-to-head comparison of deepseek-R1 and OpenAI-o1 on pancreatic adenocarcinoma-related questions  |c Cheng-Peng Li, Yuan Chu, Wei-Wei Jia, Priska Hakenberg, Flavius Șandra-Petrescu, Christoph Reißfelder, Cui Yang 
264 1 |c 2025-8-22 
300 |b Diagramme 
300 |a 10 
336 |a Text  |b txt  |2 rdacontent 
337 |a Computermedien  |b c  |2 rdamedia 
338 |a Online-Ressource  |b cr  |2 rdacarrier 
500 |a Gesehen am 03.11.2025 
520 |a Objective: This study aimed to compare the performance of DeepSeek-R1 and OpenAI-o1 in addressing complex pancreatic ductal adenocarcinoma (PDAC)-related clinical questions, focusing on accuracy, comprehensiveness, safety, and reasoning quality. Methods: Twenty PDAC-related questions derived from the up-to-date NCCN guidelines for PDAC were posed to both models. Responses were evaluated for accuracy, comprehensiveness, and safety, and chain-of-thought (CoT) outputs were rated for logical coherence and error handling by blinded clinical experts using 5-point Likert scales. Inter-rater reliability, evaluated scores, and character counts by both models were compared. Results: Both models demonstrated high accuracy (median score: 5 vs. 5, p=0.527) and safety (5 vs. 5, p=0.285). DeepSeek-R1 outperformed OpenAI-o1 in comprehensiveness (median: 5 vs. 4.5, p=0.015) and generated significantly longer responses (median characters: 544 vs. 248, p<0.001). For reasoning quality, DeepSeek-R1 achieved superior scores in logical coherence (median: 5 vs. 4, p<0.001) and error handling (5 vs. 4, p<0.001), with 75% of its responses scoring full points compared to OpenAI-o1's 5%. Conclusion: While both models exhibit high clinical utility, DeepSeek-R1's enhanced reasoning capabilities, open-source nature, and cost-effectiveness position it as a promising tool for complex oncology decision support. Further validation in real-world multimodal clinical scenarios is warranted. 
700 1 |a Chu, Yuan  |d 1994-  |e VerfasserIn  |0 (DE-588)1373626186  |0 (DE-627)1932986146  |4 aut 
700 1 |a Jia, Wei-Wei  |e VerfasserIn  |4 aut 
700 1 |a Hakenberg, Priska  |d 1990-  |e VerfasserIn  |0 (DE-588)1196385238  |0 (DE-627)1678166952  |4 aut 
700 1 |a Şandra-Petrescu, Flavius Ionuţ  |d 1977-  |e VerfasserIn  |0 (DE-588)1029219907  |0 (DE-627)732605490  |0 (DE-576)377147753  |4 aut 
700 1 |a Reißfelder, Christoph  |d 1975-  |e VerfasserIn  |0 (DE-588)1025566211  |0 (DE-627)722940297  |0 (DE-576)370516044  |4 aut 
700 1 |a Yang, Cui  |d 1984-  |e VerfasserIn  |0 (DE-588)1136151982  |0 (DE-627)891949968  |0 (DE-576)490363180  |4 aut 
773 0 8 |i Enthalten in  |t International journal of medical sciences  |d Wyoming, NSW : Ivyspring International Publ., 2004  |g 22(2025), 15, Seite 3868-3877  |h Online-Ressource  |w (DE-627)390964174  |w (DE-600)2151424-0  |w (DE-576)281254109  |x 1449-1907  |7 nnas  |a Punching above its weight a head-to-head comparison of deepseek-R1 and OpenAI-o1 on pancreatic adenocarcinoma-related questions 
773 1 8 |g volume:22  |g year:2025  |g number:15  |g pages:3868-3877  |g extent:10  |a Punching above its weight a head-to-head comparison of deepseek-R1 and OpenAI-o1 on pancreatic adenocarcinoma-related questions 
856 4 0 |u https://doi.org/10.7150/ijms.118887  |x Verlag  |x Resolving-System  |z kostenfrei  |3 Volltext 
856 4 0 |u https://www.medsci.org/v22p3868.htm  |x Verlag  |z kostenfrei  |3 Volltext 
951 |a AR 
992 |a 20251103 
993 |a Article 
994 |a 2025 
998 |g 1136151982  |a Yang, Cui  |m 1136151982:Yang, Cui  |d 60000  |d 61800  |e 60000PY1136151982  |e 61800PY1136151982  |k 0/60000/  |k 1/60000/61800/  |p 7  |y j 
998 |g 1025566211  |a Reißfelder, Christoph  |m 1025566211:Reißfelder, Christoph  |d 60000  |d 61800  |d 50000  |e 60000PR1025566211  |e 61800PR1025566211  |e 50000PR1025566211  |k 0/60000/  |k 1/60000/61800/  |k 0/50000/  |p 6 
998 |g 1029219907  |a Şandra-Petrescu, Flavius Ionuţ  |m 1029219907:Şandra-Petrescu, Flavius Ionuţ  |d 60000  |d 61800  |e 60000PS1029219907  |e 61800PS1029219907  |k 0/60000/  |k 1/60000/61800/  |p 5 
998 |g 1196385238  |a Hakenberg, Priska  |m 1196385238:Hakenberg, Priska  |d 60000  |e 60000PH1196385238  |k 0/60000/  |p 4 
998 |g 1373626186  |a Chu, Yuan  |m 1373626186:Chu, Yuan  |d 60000  |e 60000PC1373626186  |k 0/60000/  |p 2 
999 |a KXP-PPN1939897157  |e 4795689636 
BIB |a Y 
SER |a journal 
JSO |a {"person":[{"family":"Li","display":"Li, Cheng-Peng","given":"Cheng-Peng","role":"aut"},{"display":"Chu, Yuan","family":"Chu","role":"aut","given":"Yuan"},{"display":"Jia, Wei-Wei","family":"Jia","role":"aut","given":"Wei-Wei"},{"role":"aut","given":"Priska","display":"Hakenberg, Priska","family":"Hakenberg"},{"role":"aut","given":"Flavius Ionuţ","display":"Şandra-Petrescu, Flavius Ionuţ","family":"Şandra-Petrescu"},{"display":"Reißfelder, Christoph","family":"Reißfelder","given":"Christoph","role":"aut"},{"given":"Cui","role":"aut","family":"Yang","display":"Yang, Cui"}],"language":["eng"],"type":{"bibl":"article-journal","media":"Online-Ressource"},"title":[{"title_sort":"Punching above its weight","subtitle":"a head-to-head comparison of deepseek-R1 and OpenAI-o1 on pancreatic adenocarcinoma-related questions","title":"Punching above its weight"}],"note":["Gesehen am 03.11.2025"],"origin":[{"dateIssuedDisp":"2025-8-22","dateIssuedKey":"2025"}],"id":{"doi":["10.7150/ijms.118887"],"eki":["1939897157"]},"recId":"1939897157","relHost":[{"title":[{"title":"International journal of medical sciences","title_sort":"International journal of medical sciences"}],"note":["Gesehen am 01.10.2020"],"origin":[{"dateIssuedDisp":"2004-","publisher":"Ivyspring International Publ.","dateIssuedKey":"2004","publisherPlace":"Wyoming, NSW"}],"pubHistory":["1.2004 -"],"type":{"media":"Online-Ressource","bibl":"periodical"},"disp":"Punching above its weight a head-to-head comparison of deepseek-R1 and OpenAI-o1 on pancreatic adenocarcinoma-related questionsInternational journal of medical sciences","recId":"390964174","physDesc":[{"extent":"Online-Ressource"}],"id":{"eki":["390964174"],"zdb":["2151424-0"],"issn":["1449-1907"]},"language":["eng"],"part":{"year":"2025","pages":"3868-3877","issue":"15","text":"22(2025), 15, Seite 3868-3877","volume":"22","extent":"10"}}],"physDesc":[{"extent":"10 S.","noteIll":"Diagramme"}],"name":{"displayForm":["Cheng-Peng Li, Yuan Chu, Wei-Wei Jia, Priska Hakenberg, Flavius Șandra-Petrescu, Christoph Reißfelder, Cui Yang"]}} 
SRT |a LICHENGPENPUNCHINGAB2025