Dataset Information

A multiple hold-out framework for Sparse Partial Least Squares.

ABSTRACT: Supervised classification machine learning algorithms may have limitations when studying brain diseases with heterogeneous populations, as the labels might be unreliable. More exploratory approaches, such as Sparse Partial Least Squares (SPLS), may provide insights into the brain's mechanisms by finding relationships between neuroimaging and clinical/demographic data. The identification of these relationships has the potential to improve the current understanding of disease mechanisms, refine clinical assessment tools, and stratify patients. SPLS finds multivariate associative effects in the data by computing pairs of sparse weight vectors, where each pair is used to remove its corresponding associative effect from the data by matrix deflation, before computing additional pairs.We propose a novel SPLS framework which selects the adequate number of voxels and clinical variables to describe each associative effect, and tests their reliability by fitting the model to different splits of the data. As a proof of concept, the approach was applied to find associations between grey matter probability maps and individual items of the Mini-Mental State Examination (MMSE) in a clinical sample with various degrees of dementia.The framework found two statistically significant associative effects between subsets of brain voxels and subsets of the questions/tasks.SPLS was compared with its non-sparse version (PLS). The use of projection deflation versus a classical PLS deflation was also tested in both PLS and SPLS.SPLS outperformed PLS, finding statistically significant effects and providing higher correlation values in hold-out data. Moreover, projection deflation provided better results.

SUBMITTER: Monteiro JM

PROVIDER: S-EPMC5012894 | biostudies-other | 2016 Sep

REPOSITORIES: biostudies-other

ACCESS DATA

Publications

A multiple hold-out framework for Sparse Partial Least Squares.

Monteiro João M JM Rao Anil A Shawe-Taylor John J Mourão-Miranda Janaina J

Journal of neuroscience methods 20160626

<h4>Background</h4>Supervised classification machine learning algorithms may have limitations when studying brain diseases with heterogeneous populations, as the labels might be unreliable. More exploratory approaches, such as Sparse Partial Least Squares (SPLS), may provide insights into the brain's mechanisms by finding relationships between neuroimaging and clinical/demographic data. The identification of these relationships has the potential to improve the current understanding of disease me ...[more]

PMID: 27353722

Dataset Information

A multiple hold-out framework for Sparse Partial Least Squares.

Publications

A multiple hold-out framework for Sparse Partial Least Squares.

OmicsDI is part of the ELIXIR infrastructure

Tweets