Unknown

Dataset Information

0

Multilevel Functional Principal Component Analysis for High-Dimensional Data.


ABSTRACT: We propose fast and scalable statistical methods for the analysis of hundreds or thousands of high dimensional vectors observed at multiple visits. The proposed inferential methods do not require loading the entire data set at once in the computer memory and instead use only sequential access to data. This allows deployment of our methodology on low-resource computers where computations can be done in minutes on extremely large data sets. Our methods are motivated by and applied to a study where hundreds of subjects were scanned using Magnetic Resonance Imaging (MRI) at two visits roughly five years apart. The original data possesses over ten billion measurements. The approach can be applied to any type of study where data can be unfolded into a long vector including densely observed functions and images. Supplemental materials are provided with source code for simulations, some technical details and proofs, and additional imaging results of the brain study.

SUBMITTER: Zipunnikov V 

PROVIDER: S-EPMC4425352 | biostudies-other | 2011

REPOSITORIES: biostudies-other

altmetric image

Publications

Multilevel Functional Principal Component Analysis for High-Dimensional Data.

Zipunnikov Vadim V   Caffo Brian B   Yousem David M DM   Davatzikos Christos C   Schwartz Brian S BS   Crainiceanu Ciprian C  

Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America 20110101 4


We propose fast and scalable statistical methods for the analysis of hundreds or thousands of high dimensional vectors observed at multiple visits. The proposed inferential methods do not require loading the entire data set at once in the computer memory and instead use only sequential access to data. This allows deployment of our methodology on low-resource computers where computations can be done in minutes on extremely large data sets. Our methods are motivated by and applied to a study where  ...[more]

Similar Datasets

| S-EPMC2835171 | biostudies-literature
| S-EPMC4032817 | biostudies-other
| S-EPMC7449232 | biostudies-literature
| S-EPMC7313548 | biostudies-literature
| S-EPMC3131008 | biostudies-literature
| S-EPMC4721272 | biostudies-literature
| S-EPMC4383722 | biostudies-literature
| S-EPMC4479975 | biostudies-literature
| S-EPMC5517364 | biostudies-literature
| S-EPMC4135472 | biostudies-literature