The art of diagnosing rare skin tumors: can DL-CNNs enhance dermatologists’ diagnostic accuracy?

Importance - Deep learning convolutional neural networks (DL-CNN) achieved diagnostic accuracies comparable to dermatologists in controlled test environments. However, their performance in diagnosing rare skin tumors (RST) remains unclear. This study aimed to evaluate a binary DL-CNN’s diagnostic pe...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Vollmer, Anastasia S. (VerfasserIn) , Winkler, Julia K. (VerfasserIn) , Kommoss, Katharina (VerfasserIn) , Blum, A. (VerfasserIn) , Tschandl, P. (VerfasserIn) , Kränke, T. (VerfasserIn) , Hofmann-Wellenhof, E. (VerfasserIn) , Hofmann-Wellenhof, R. (VerfasserIn) , Stolz, W. (VerfasserIn) , Enk, Alexander (VerfasserIn) , Hänßle, Holger (VerfasserIn)
Dokumenttyp: Article (Journal)
Sprache:Englisch
Veröffentlicht: 1 October 2025
In: European journal of cancer
Year: 2025, Jahrgang: 228, Pages: 1-8
ISSN:1879-0852
DOI:10.1016/j.ejca.2025.115751
Online-Zugang:Verlag, kostenfrei, Volltext: https://doi.org/10.1016/j.ejca.2025.115751
Verlag, kostenfrei, Volltext: https://www.sciencedirect.com/science/article/pii/S0959804925005362
Volltext
Verfasserangaben:A.S. Vollmer, J.K. Winkler, K. Kommoss, A. Blum, P. Tschandl, T. Kränke, E. Hofmann-Wellenhof, R. Hofmann-Wellenhof, W. Stolz, A. Enk, H.A. Haenssle
Beschreibung
Zusammenfassung:Importance - Deep learning convolutional neural networks (DL-CNN) achieved diagnostic accuracies comparable to dermatologists in controlled test environments. However, their performance in diagnosing rare skin tumors (RST) remains unclear. This study aimed to evaluate a binary DL-CNN’s diagnostic performance in RST and assess the level of support for an international group of dermatologists. - Methods - In a cross-sectional reader study, a market-approved binary DL-CNN (Moleanalyzer-Pro) assessed 200 dermoscopic images in a conveniance sample of histologically confirmed RST. An international panel of dermatologists rated malignancy and management across three levels: (I) dermoscopy only, (II) dermoscopy, close-up images, and metadata, (III) level-II plus DL-CNN malignancy predictions. Sensitivity, specificity, and the area under the receiver operating characteristic curve (ROC-AUC) for the DL-CNN versus dermatologists (level-II). Secondary outcomes included performance changes across study levels. - Results - The DL-CNN achieved a sensitivity (95% CI) of 66.7% (56.4%-75.6%), specificity of 56.4% (47.0%-65.3%), and ROC-AUC of 0.634 (0.557-0.711). Dermatologists reached a significantly higher mean sensitivity (80.3%, 77.3%-83.4%), specificity (65.1%, 61.3%-69.0%), and ROC-AUC (0.839, 0.783-0.894; all p<0.001). With DL-CNN predictions, dermatologists’ sensitivity slightly increased (81.3%, p=0.032), specificity decreased (64.0%, p=0.036), and ROC-AUC remained unchanged. The DL-CNN could not improve dermatologists’ accuracy in misclassified cases. - Conclusion - The tested DL-CNN showed a limited diagnostic performance in diagnosing RST. While minor effects on expert decision-making were observed, overall diagnostic accuracy remained highest with full clinical context. Better training data are needed for improved DL-CNN performance in RST.
Beschreibung:Available online 29 August 2025, Version of Record 4 September 2025
Gesehen am 23.01.2026
Beschreibung:Online Resource
ISSN:1879-0852
DOI:10.1016/j.ejca.2025.115751