Comparative evaluation of machine learning strategies for analyzing big data in psychiatry

The requirement of innovative big data analytics has become a critical success factor for research in biological psychiatry. Integrative analyses across distributed data resources are considered essential for untangling the biological complexity of mental illnesses. However, little is known about al...

Full description

Saved in:
Bibliographic Details
Main Authors: Cao, Han (Author) , Meyer-Lindenberg, Andreas (Author) , Schwarz, Emanuel (Author)
Format: Article (Journal)
Language:English
Published: 29 October 2018
In: International journal of molecular sciences
Year: 2018, Volume: 19, Issue: 11
ISSN:1422-0067
DOI:10.3390/ijms19113387
Online Access:Verlag, Volltext: http://dx.doi.org/10.3390/ijms19113387
Verlag, Volltext: https://www.mdpi.com/1422-0067/19/11/3387
Get full text
Author Notes:Han Cao, Andreas Meyer-Lindenberg, Emanuel Schwarz (Department of Psychiatry and Psychotherapy, Central Institute of Mental Health, Medical Faculty Mannheim, Heidelberg University)
Description
Summary:The requirement of innovative big data analytics has become a critical success factor for research in biological psychiatry. Integrative analyses across distributed data resources are considered essential for untangling the biological complexity of mental illnesses. However, little is known about algorithm properties for such integrative machine learning. Here, we performed a comparative analysis of eight machine learning algorithms for identification of reproducible biological fingerprints across data sources, using five transcriptome-wide expression datasets of schizophrenia patients and controls as a use case. We found that multi-task learning (MTL) with network structure (MTL_NET) showed superior accuracy compared to other MTL formulations as well as single task learning, and tied performance with support vector machines (SVM). Compared to SVM, MTL_NET showed significant benefits regarding the variability of accuracy estimates, as well as its robustness to cross-dataset and sampling variability. These results support the utility of this algorithm as a flexible tool for integrative machine learning in psychiatry.
Item Description:Gesehen am 15.03.2019
Physical Description:Online Resource
ISSN:1422-0067
DOI:10.3390/ijms19113387