Deep learning predicts microsatellite instability status in colorectal carcinoma in an ethnically heterogeneous population in South Africa

Background Deep learning (DL) models are effective pre-screening tools for detecting mismatch repair deficiency (dMMR) in colorectal carcinoma (CRC). These models have been trained and validated on large cohorts from the Northern Hemisphere, without representation of African samples. We sought to de...

Full description

Saved in:
Bibliographic Details
Main Authors: Aldera, Alessandro Pietro (Author) , Cifci, Didem (Author) , Veldhuizen, Gregory Patrick (Author) , Tsai, Wan-Jung (Author) , Pillay, Komala (Author) , Boutall, Adam (Author) , Brenner, Hermann (Author) , Hoffmeister, Michael (Author) , Kather, Jakob Nikolas (Author) , Ramesar, Raj (Author)
Format: Article (Journal)
Language:English
Published: 20 May 2025
In: Journal of clinical pathology
Year: 2025, Volume: 79, Issue: 1, Pages: 50-56
ISSN:1472-4146
DOI:10.1136/jcp-2025-210053
Online Access:Verlag, lizenzpflichtig, Volltext: https://doi.org/10.1136/jcp-2025-210053
Verlag, lizenzpflichtig, Volltext: https://jcp.bmj.com/content/early/2025/05/20/jcp-2025-210053
Get full text
Author Notes:Alessandro Pietro Aldera, Didem Cifci, Gregory Patrick Veldhuizen, Wan-Jung Tsai, Komala Pillay, Adam Boutall, Hermann Brenner, Michael Hoffmeister, Jakob Nikolas Kather, Raj Ramesar
Description
Summary:Background Deep learning (DL) models are effective pre-screening tools for detecting mismatch repair deficiency (dMMR) in colorectal carcinoma (CRC). These models have been trained and validated on large cohorts from the Northern Hemisphere, without representation of African samples. We sought to determine the performance of a DL model in an ethnically heterogeneous cohort of patients from South Africa. - Methods Our cohort comprised 197 CRC resection specimens, with scanned whole slide images tessellated and inputted into a transformer-based DL model trained on large international cohorts. Model performance was evaluated using area under the receiver operating characteristic curve (AUROC), sensitivity and specificity. The maximal Youden’s J index was calculated to determine the optimal cut-off threshold for the model prediction score. - Results Our model yielded an AUROC of 0.91 (±0.05). Using a prediction score threshold of 0.620 produced an overall sensitivity of 85.7% (95% CI 73.3% to 92.9%) and a specificity of 82.4% (95% CI 75.5% to 87.7%). The false negative cases were predominantly left-sided (71.4%) and did not show the typical dMMR/microsatellite instability-high histological phenotype. Sensitivity was lower (50%-75%) in cases showing isolated PMS2 or MSH6 loss of staining. Calibrating the classification threshold to 0.470, the sensitivity was optimised to 95.6% (95% CI 86.3% to 98.9%) with a specificity of 69.6% (95% CI 61.8% to 76.4%). This would have resulted in excluding 103 cases (52.3%) from downstream immunohistochemical (IHC) or molecular testing. - Conclusions Following appropriate region-specific calibration, we have shown that this model could be employed to accurately prescreen for dMMR in CRC, thereby reducing the burden of downstream IHC and molecular testing in a resource-limited setting.
Item Description:Gesehen am 06.11.2025
Physical Description:Online Resource
ISSN:1472-4146
DOI:10.1136/jcp-2025-210053