Unknown

Dataset Information

0

ROC-supervised principal component analysis in connection with the diagnosis of diseases.


ABSTRACT: Principal component analysis (PCA) is a data analysis method that can deal with large volumes of data. Owing to the complexity and volume of the data generated by today's advanced technologies in genomics, pro-teomics, and metabolomics, PCA has become predominant in the medical sciences. Despite its popularity, PCA leaves much to be desired in terms of accuracy and may not be suitable for certain medical applications, such as diagnostics, where accuracy is paramount. In this study, we introduced a new PCA method, one that is carefully supervised by receiver operating characteristic (ROC) curve analysis. In order to assess its performance with respect to its ability to render an accurate differential diagnosis, and to compare its performance with that of standard PCA, we studied the striatal metabolomic profile of R6/2 Huntington disease (HD) transgenic mice, as well as that of wild type (WT) mice, using high field in vivo proton nuclear magnetic resonance (NMR) spectroscopy (9.4-Tesla). We tested both the standard PCA and our ROC-supervised PCA (using in each case both the covariance and the correlation matrix), 1) with the original R6/2 HD mice and WT mice, 2) with unknown mice, whose status had been determined via genotyping, and 3) with the ability to separate the original R6/2 mice into the two age subgroups (8 and 12 wks old). Only our ROC-supervised PCA (both with the covariance and the correlation matrix) passed all tests with a total accuracy of 100%; thus, providing evidence that it may be used for diagnostic purposes.

SUBMITTER: Nikas JB 

PROVIDER: S-EPMC3056564 | biostudies-literature | 2011 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

ROC-supervised principal component analysis in connection with the diagnosis of diseases.

Nikas Jason B JB   Low Walter C WC  

American journal of translational research 20110203 2


Principal component analysis (PCA) is a data analysis method that can deal with large volumes of data. Owing to the complexity and volume of the data generated by today's advanced technologies in genomics, pro-teomics, and metabolomics, PCA has become predominant in the medical sciences. Despite its popularity, PCA leaves much to be desired in terms of accuracy and may not be suitable for certain medical applications, such as diagnostics, where accuracy is paramount. In this study, we introduced  ...[more]

Similar Datasets

| S-EPMC7274418 | biostudies-literature
| S-EPMC4046680 | biostudies-literature
2019-02-26 | GSE120584 | GEO
| S-EPMC3775684 | biostudies-literature
2011-08-15 | GSE31375 | GEO
| S-EPMC2835171 | biostudies-literature
| S-EPMC3131008 | biostudies-literature