Unknown

Dataset Information

0

EmeraLD: rapid linkage disequilibrium estimation with massive datasets.


ABSTRACT:

Summary

Estimating linkage disequilibrium (LD) is essential for a wide range of summary statistics-based association methods for genome-wide association studies. Large genetic datasets, e.g. the TOPMed WGS project and UK Biobank, enable more accurate and comprehensive LD estimates, but increase the computational burden of LD estimation. Here, we describe emeraLD (Efficient Methods for Estimation and Random Access of LD), a computational tool that leverages sparsity and haplotype structure to estimate LD up to 2 orders of magnitude faster than current tools.

Availability and implementation

emeraLD is implemented in C++, and is open source under GPLv3. Source code and documentation are freely available at http://github.com/statgen/emeraLD.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Quick C 

PROVIDER: S-EPMC6298049 | biostudies-literature | 2019 Jan

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC7038669 | biostudies-literature
| S-EPMC3431250 | biostudies-literature
| S-EPMC1866700 | biostudies-literature
| S-EPMC5996054 | biostudies-literature
| S-EPMC5972415 | biostudies-other
| S-EPMC7418405 | biostudies-literature
| S-EPMC4125401 | biostudies-literature
| S-EPMC5806858 | biostudies-literature
| S-EPMC5400794 | biostudies-literature
| S-EPMC2795174 | biostudies-literature