SwarmMAP: swarm learning for decentralized cell type annotation in single cell sequencing data

Rapid technological progress now enables large-scale generation of single-cell data. Many laboratories can produce single-cell transcriptomic profiles from diverse tissues. A key step in single-cell analysis is unsupervised clustering followed by cell-type annotation, yet there is no agreement on ma...

Full description

Saved in:
Bibliographic Details
Main Authors: Saldanha, Oliver Lester (Author) , Goepp, Vivien (Author) , Pfeiffer, Kevin (Author) , Kim, Hyojin (Author) , Zhu, Jie Fu (Author) , Kramann, Rafael (Author) , Hayat, Sikander (Author) , Kather, Jakob Nikolas (Author)
Format: Article (Journal)
Language:English
Published: 18 February 2026
In: npj Systems biology and applications
Year: 2026, Volume: 12, Pages: 1-12
ISSN:2056-7189
DOI:10.1038/s41540-026-00667-6
Online Access:Verlag, kostenfrei, Volltext: https://doi.org/10.1038/s41540-026-00667-6
Verlag, kostenfrei, Volltext: https://www.nature.com/articles/s41540-026-00667-6
Get full text
Author Notes:Oliver Lester Saldanha, Vivien Goepp, Kevin Pfeiffer, Hyojin Kim, Jie Fu Zhu, Rafael Kramann, Sikander Hayat & Jakob Nikolas Kather
Description
Summary:Rapid technological progress now enables large-scale generation of single-cell data. Many laboratories can produce single-cell transcriptomic profiles from diverse tissues. A key step in single-cell analysis is unsupervised clustering followed by cell-type annotation, yet there is no agreement on marker genes, and annotation is typically done manually, making it irreproducible and poorly scalable. Privacy constraints in human datasets further complicate data sharing. There is a need for standardized, automated, and privacy-preserving cell-type annotation across datasets. We developed SwarmMAP, which applies Swarm Learning to train machine-learning models for cell-type classification in a decentralized setting without exchanging raw data between centers. SwarmMAP achieves F1-scores of 0.93, 0.98, and 0.88 in heart, lung, and breast datasets, respectively. Swarm Learning models reach an average performance of 0.907, comparable to models trained on centralized data (p-val = 0.937, Mann-Whitney U Test). Increasing the number of datasets improves prediction accuracy and supports classification across broader cell-type diversity. These results show that Swarm Learning provides an effective approach for automated cell-type annotation. SwarmMAP is available at https://github.com/hayatlab/SwarmMAP.
Item Description:Veröffentlicht: 18. Februar 2026
Gesehen am 30.04.2026
Physical Description:Online Resource
ISSN:2056-7189
DOI:10.1038/s41540-026-00667-6