Unknown

Dataset Information

0

An accurate clone-based haplotyping method by overlapping pool sequencing.


ABSTRACT: Chromosome-long haplotyping of human genomes is important to identify genetic variants with differing gene expression, in human evolution studies, clinical diagnosis, and other biological and medical fields. Although several methods have realized haplotyping based on sequencing technologies or population statistics, accuracy and cost are factors that prohibit their wide use. Borrowing ideas from group testing theories, we proposed a clone-based haplotyping method by overlapping pool sequencing. The clones from a single individual were pooled combinatorially and then sequenced. According to the distinct pooling pattern for each clone in the overlapping pool sequencing, alleles for the recovered variants could be assigned to their original clones precisely. Subsequently, the clone sequences could be reconstructed by linking these alleles accordingly and assembling them into haplotypes with high accuracy. To verify the utility of our method, we constructed 130 110 clones in silico for the individual NA12878 and simulated the pooling and sequencing process. Ultimately, 99.9% of variants on chromosome 1 that were covered by clones from both parental chromosomes were recovered correctly, and 112 haplotype contigs were assembled with an N50 length of 3.4 Mb and no switch errors. A comparison with current clone-based haplotyping methods indicated our method was more accurate.

SUBMITTER: Li C 

PROVIDER: S-EPMC4937318 | biostudies-literature | 2016 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

An accurate clone-based haplotyping method by overlapping pool sequencing.

Li Cheng C   Cao Changchang C   Tu Jing J   Sun Xiao X  

Nucleic acids research 20160419 12


Chromosome-long haplotyping of human genomes is important to identify genetic variants with differing gene expression, in human evolution studies, clinical diagnosis, and other biological and medical fields. Although several methods have realized haplotyping based on sequencing technologies or population statistics, accuracy and cost are factors that prohibit their wide use. Borrowing ideas from group testing theories, we proposed a clone-based haplotyping method by overlapping pool sequencing.  ...[more]

Similar Datasets

| S-EPMC4053695 | biostudies-literature
| S-EPMC403814 | biostudies-literature
| S-EPMC4229885 | biostudies-literature
| S-EPMC3397394 | biostudies-literature
| S-EPMC6214236 | biostudies-literature
| S-EPMC6612846 | biostudies-literature
| S-EPMC1919397 | biostudies-literature
| S-EPMC5013932 | biostudies-literature
| S-EPMC4162929 | biostudies-literature
| S-EPMC5870854 | biostudies-other