Ontology highlight
ABSTRACT:
SUBMITTER: Abraham G
PROVIDER: S-EPMC3981753 | biostudies-literature | 2014
REPOSITORIES: biostudies-literature
Abraham Gad G Inouye Michael M
PloS one 20140409 4
Principal component analysis (PCA) is routinely used to analyze genome-wide single-nucleotide polymorphism (SNP) data, for detecting population structure and potential outliers. However, the size of SNP datasets has increased immensely in recent years and PCA of large datasets has become a time consuming task. We have developed flashpca, a highly efficient PCA implementation based on randomized algorithms, which delivers identical accuracy in extracting the top principal components compared with ...[more]