Interpretable type 2 diabetes incidence prediction with AutoScore: A model based on standard clinical parameters
Objective - Accurate prediction of type 2 diabetes mellitus (T2DM) onset is critical to enable timely interventions and preventive strategies. Although machine learning (ML) approaches have shown promise in risk prediction, their complexity often limits clinical implementation. There is a need for i...
Saved in:
| Main Authors: | , , , , , , , , , , |
|---|---|
| Format: | Article (Journal) |
| Language: | English |
| Published: |
February 2026
|
| In: |
International journal of medical informatics
Year: 2026, Volume: 206, Pages: 1-8 |
| ISSN: | 1872-8243 |
| DOI: | 10.1016/j.ijmedinf.2025.106161 |
| Online Access: | Verlag, lizenzpflichtig, Volltext: https://doi.org/10.1016/j.ijmedinf.2025.106161 Verlag, lizenzpflichtig, Volltext: https://www.sciencedirect.com/science/article/pii/S1386505625003788 |
| Author Notes: | Andreas Leiherer, Laura Schnetzer, Sylvia Mink, Arthur Mader, Axel Mündlein, Bernhard Bermeitinger, Angela P. Moissl-Blanke, Winfried März, Angelika Hammerer-Lercher, Marcus E. Kleber, Heinz Drexel |
| Summary: | Objective - Accurate prediction of type 2 diabetes mellitus (T2DM) onset is critical to enable timely interventions and preventive strategies. Although machine learning (ML) approaches have shown promise in risk prediction, their complexity often limits clinical implementation. There is a need for interpretable, user-friendly models that retain predictive strength. - Methods - We studied 904 cardiovascular risk patients without T2DM at baseline, assessing 71 anthropometric, clinical, and laboratory variables. Over a four-year follow-up, 10 % developed T2DM. We applied AutoScore, an interpretable ML framework that generates parsimonious, point-based risk scores, and compared its performance with an optimized Support Vector Machine (SVM) with a linear kernel. The SVM was refined using feature selection, Tomek link removal, and up-sampling to address class imbalance. - Results - Both approaches consistently identified fasting glucose, OGTT glucose, and the Matsuda index (reflecting glucose-insulin dynamics) as key predictors. The optimized SVM model achieved a higher balanced accuracy (75 % vs. 67 %), specificity (80 % vs. 77 %), and AUC (0.72 vs. 0.69) compared to AutoScore. However, AutoScore, other than the SVM model, relied exclusively on a small set of routinely available accessible parameters and thereby offered superior interpretability and ease of integration into clinical workflows. External validation in an independent cohort further confirmed the robustness of the AutoScore model. - Conclusion - Although black-box models such as SVM deliver slightly higher predictive accuracy, interpretable frameworks like AutoScore provide clinically actionable risk stratification based on standard data. Their transparency and simplicity make them particularly valuable for real-world decision support. |
|---|---|
| Item Description: | Online verfügbar: 18. Oktober 2025, Artikelversion: 1. November 2025 Gesehen am 12.11.2025 |
| Physical Description: | Online Resource |
| ISSN: | 1872-8243 |
| DOI: | 10.1016/j.ijmedinf.2025.106161 |