Exploring the potential of AI-powered applications for clinical decision-making in gynecologic oncology
Objective The rise of artificial intelligence (AI) and large language models like Llama, Gemini, or Generative Pretraining Transformer (GPT) signals a promising new era in natural language processing and has significant potential for application in medical care. This study seeks to investigate the p...
Saved in:
| Main Authors: | , , , , , , , , |
|---|---|
| Format: | Article (Journal) |
| Language: | English |
| Published: |
2025
|
| In: |
International journal of gynecology & obstetrics
Year: 2025, Pages: 1-7 |
| ISSN: | 1879-3479 |
| DOI: | 10.1002/ijgo.70251 |
| Online Access: | Verlag, kostenfrei, Volltext: https://doi.org/10.1002/ijgo.70251 Verlag, kostenfrei, Volltext: https://onlinelibrary.wiley.com/doi/abs/10.1002/ijgo.70251 |
| Author Notes: | Bastian Meyer, Raphael Kfuri-Rubens, Georg Schmidt, Maliha Tariq, Caroline Riedel, Florian Recker, Fabian Riedel, Marion Kiechle, Maximilian Riedel |
| Summary: | Objective The rise of artificial intelligence (AI) and large language models like Llama, Gemini, or Generative Pretraining Transformer (GPT) signals a promising new era in natural language processing and has significant potential for application in medical care. This study seeks to investigate the potential of GPT-4 for automated therapy recommendations by examining individual patient health record data with a focus on gynecologic malignancies and breast cancer. Methods We tasked GPT-4 with generating independent treatment proposals for 60 randomly selected patient cases presented at gynecologic and senologic multidisciplinary tumor boards (MDTs). The treatment recommendations by GPT-4 were compared with those of the MDTs using a novel clinical concordance score and were reviewed both qualitatively and quantitatively by experienced gynecologic oncologists. Results GPT-4 generated coherent therapeutic recommendations for all clinical cases. Overall, these recommendations were assessed by clinical experts as moderately sufficient for real-word clinical application. Deficiencies in both accuracy and completeness were especially noted. Using a quantitative clinical concordance score, GPT-4 consistently demonstrated superior performance in managing the senologic cases compared with the gynecologic cases. Iterative prompting substantially enhanced treatment recommendations in both categories, increasing concordance with MDT decisions to up to 84% in senologic cases. Conclusion GPT-4 is capable of processing complex patient cases and generates detailed treatment recommendations; however, differences persist in surgical approaches and the use of systemic therapies, and there is a tendency toward recommending excessive genetic testing. As AI-powered solutions continue to be integrated into medicine, we envision the potential for automated therapy recommendations to play a supportive role in human clinical decision-making in the future. |
|---|---|
| Item Description: | Gesehen am 20.10.2025 |
| Physical Description: | Online Resource |
| ISSN: | 1879-3479 |
| DOI: | 10.1002/ijgo.70251 |