Unknown

Dataset Information

0

A new tool called DISSECT for analysing large genomic data sets using a Big Data approach.


ABSTRACT: Large-scale genetic and genomic data are increasingly available and the major bottleneck in their analysis is a lack of sufficiently scalable computational tools. To address this problem in the context of complex traits analysis, we present DISSECT. DISSECT is a new and freely available software that is able to exploit the distributed-memory parallel computational architectures of compute clusters, to perform a wide range of genomic and epidemiologic analyses, which currently can only be carried out on reduced sample sizes or under restricted conditions. We demonstrate the usefulness of our new tool by addressing the challenge of predicting phenotypes from genotype data in human populations using mixed-linear model analysis. We analyse simulated traits from 470,000 individuals genotyped for 590,004 SNPs in ?4?h using the combined computational power of 8,400 processor cores. We find that prediction accuracies in excess of 80% of the theoretical maximum could be achieved with large sample sizes.

SUBMITTER: Canela-Xandri O 

PROVIDER: S-EPMC4682108 | biostudies-literature | 2015 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

A new tool called DISSECT for analysing large genomic data sets using a Big Data approach.

Canela-Xandri Oriol O   Law Andy A   Gray Alan A   Woolliams John A JA   Tenesa Albert A  

Nature communications 20151211


Large-scale genetic and genomic data are increasingly available and the major bottleneck in their analysis is a lack of sufficiently scalable computational tools. To address this problem in the context of complex traits analysis, we present DISSECT. DISSECT is a new and freely available software that is able to exploit the distributed-memory parallel computational architectures of compute clusters, to perform a wide range of genomic and epidemiologic analyses, which currently can only be carried  ...[more]

Similar Datasets

| S-EPMC2375126 | biostudies-literature
| S-EPMC6849186 | biostudies-literature
| S-EPMC8280138 | biostudies-literature
| S-EPMC2573859 | biostudies-literature
| S-EPMC5809091 | biostudies-literature
| S-EPMC6886512 | biostudies-literature
| S-EPMC7538337 | biostudies-literature
| S-EPMC7816668 | biostudies-literature
| S-EPMC5343946 | biostudies-literature
| S-EPMC4563723 | biostudies-literature