Detect influential points of feature rankings

Background - Feature rankings are crucial in bioinformatics but can be distorted by influential points (IPs), which are often overlooked. This study aims to investigate the impact of IPs on feature rankings and propose IPs detection method - Method - We use a leave-one-out approach to assess each ca...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Wang, Shuo (VerfasserIn) , Lu, Junyan (VerfasserIn)
Dokumenttyp: Article (Journal)
Sprache:Englisch
Veröffentlicht: April 2025
In: Computational biology and chemistry
Year: 2025, Jahrgang: 115, Pages: 1-20
DOI:10.1016/j.compbiolchem.2024.108339
Online-Zugang:Verlag, lizenzpflichtig, Volltext: https://doi.org/10.1016/j.compbiolchem.2024.108339
Verlag, lizenzpflichtig, Volltext: https://www.sciencedirect.com/science/article/pii/S147692712400327X
Volltext
Verfasserangaben:Shuo Wang, Junyan Lu
Beschreibung
Zusammenfassung:Background - Feature rankings are crucial in bioinformatics but can be distorted by influential points (IPs), which are often overlooked. This study aims to investigate the impact of IPs on feature rankings and propose IPs detection method - Method - We use a leave-one-out approach to assess each case's influence on feature rankings by comparing rank changes after its removal. The rank changes are measured by a novel rank comparison method that involves using adaptive top-prioritized weights that are adjustable to the distribution of rank changes. Our IP detection method was evaluated on several public datasets. - Results - Our method identified potential IPs in several TCGA gene expression datasets, revealing that IPs can severely distort feature rankings. These rank changes can ultimately affect subsequent analyses such as enriched pathways, suggesting the necessity of IPs detection when deriving feature rankings. - Conclusions - IPs significantly impact feature rankings and subsequent analyses; routine IP detection is necessary yet underutilized. Our method is available in the R package findIPs.
Beschreibung:Gesehen am 30.09.2025
Beschreibung:Online Resource
DOI:10.1016/j.compbiolchem.2024.108339