Unknown

Dataset Information

0

A Fast and Simple Method for Detecting Identity-by-Descent Segments in Large-Scale Data.


ABSTRACT: Segments of identity by descent (IBD) are used in many genetic analyses. We present a method for detecting identical-by-descent haplotype segments in phased genotype data. Our method, called hap-IBD, combines a compressed representation of haplotype data, the positional Burrows-Wheeler transform, and multi-threaded execution to produce very fast analysis times. An attractive feature of hap-IBD is its simplicity: the input parameters clearly and precisely define the IBD segments that are reported, so that program correctness can be confirmed by users. We evaluate hap-IBD and four state-of-the-art IBD segment detection methods (GERMLINE, iLASH, RaPID, and TRUFFLE) using UK Biobank chromosome 20 data and simulated sequence data. We show that hap-IBD detects IBD segments faster and more accurately than competing methods, and that hap-IBD is the only method that can rapidly and accurately detect short 2-4 centiMorgan (cM) IBD segments in the full UK Biobank data. Analysis of 485,346 UK Biobank samples through the use of hap-IBD with 12 computational threads detects 231.5 billion autosomal IBD segments with length ?2 cM in 24.4 h.

SUBMITTER: Zhou Y 

PROVIDER: S-EPMC7118582 | biostudies-literature | 2020 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

A Fast and Simple Method for Detecting Identity-by-Descent Segments in Large-Scale Data.

Zhou Ying Y   Browning Sharon R SR   Browning Brian L BL  

American journal of human genetics 20200312 4


Segments of identity by descent (IBD) are used in many genetic analyses. We present a method for detecting identical-by-descent haplotype segments in phased genotype data. Our method, called hap-IBD, combines a compressed representation of haplotype data, the positional Burrows-Wheeler transform, and multi-threaded execution to produce very fast analysis times. An attractive feature of hap-IBD is its simplicity: the input parameters clearly and precisely define the IBD segments that are reported  ...[more]

Similar Datasets

| S-EPMC3035716 | biostudies-literature
| S-EPMC4315301 | biostudies-literature
| S-EPMC7750976 | biostudies-literature
| S-EPMC3905877 | biostudies-other
| S-EPMC3824133 | biostudies-literature
| S-EPMC4104314 | biostudies-literature
| S-EPMC7332656 | biostudies-literature
| S-EPMC6659282 | biostudies-literature
| S-EPMC4856080 | biostudies-other
| S-EPMC8631639 | biostudies-literature