Gene expression profiling of breast cancer survivability by pooled cDNA microarray analysis using logistic regression, artificial neural networks and decision trees

Microarray technology can acquire information about thousands of genes simultaneously. We analyzed published breast cancer microarray databases to predict five-year recurrence and compared the performance of three data mining algorithms of artificial neural networks (ANN), decision trees (DT) and lo...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Chou, Hsiu-Ling (VerfasserIn) , Yao, Chung-Tay (VerfasserIn) , Su, Sui-Lun (VerfasserIn) , Lee, Chia-Yi (VerfasserIn) , Hu, Kuang-Yu (VerfasserIn) , Terng, Harn-Jing (VerfasserIn) , Shih, Yun-Wen (VerfasserIn) , Chang, Yu-Tien (VerfasserIn) , Lu, Yu-Fen (VerfasserIn) , Chang, Chi-Wen (VerfasserIn) , Wahlqvist, Mark L. (VerfasserIn) , Wetter, Thomas (VerfasserIn) , Chu, Chi-Ming (VerfasserIn)
Dokumenttyp: Article (Journal)
Sprache:Englisch
Veröffentlicht: 19 March 2013
In: BMC bioinformatics
Year: 2013, Jahrgang: 14, Heft: 1, Pages: 1-11
ISSN:1471-2105
DOI:10.1186/1471-2105-14-100
Online-Zugang:Verlag, lizenzpflichtig, Volltext: https://doi.org/10.1186/1471-2105-14-100
Volltext
Verfasserangaben:Hsiu-Ling Chou, Chung-Tay Yao, Sui-Lun Su, Chia-Yi Lee, Kuang-Yu Hu, Harn-Jing Terng, Yun-Wen Shih, Yu-Tien Chang, Yu-Fen Lu, Chi-Wen Chang, Mark L. Wahlqvist, Thomas Wetter and Chi-Ming Chu
Beschreibung
Zusammenfassung:Microarray technology can acquire information about thousands of genes simultaneously. We analyzed published breast cancer microarray databases to predict five-year recurrence and compared the performance of three data mining algorithms of artificial neural networks (ANN), decision trees (DT) and logistic regression (LR) and two composite models of DT-ANN and DT-LR. The collection of microarray datasets from the Gene Expression Omnibus, four breast cancer datasets were pooled for predicting five-year breast cancer relapse. After data compilation, 757 subjects, 5 clinical variables and 13,452 genetic variables were aggregated. The bootstrap method, Mann-Whitney U test and 20-fold cross-validation were performed to investigate candidate genes with 100 most-significant p-values. The predictive powers of DT, LR and ANN models were assessed using accuracy and the area under ROC curve. The associated genes were evaluated using Cox regression.
Beschreibung:Gesehen am 16.04.2021
Beschreibung:Online Resource
ISSN:1471-2105
DOI:10.1186/1471-2105-14-100