Unknown

Dataset Information

0

Targeted short read sequencing and assembly of re-arrangements and candidate gene loci provide megabase diplotypes.


ABSTRACT: The human genome is composed of two haplotypes, otherwise called diplotypes, which denote phased polymorphisms and structural variations (SVs) that are derived from both parents. Diplotypes place genetic variants in the context of cis-related variants from a diploid genome. As a result, they provide valuable information about hereditary transmission, context of SV, regulation of gene expression and other features which are informative for understanding human genetics. Successful diplotyping with short read whole genome sequencing generally requires either a large population or parent-child trio samples. To overcome these limitations, we developed a targeted sequencing method for generating megabase (Mb)-scale haplotypes with short reads. One selects specific 0.1-0.2 Mb high molecular weight DNA targets with custom-designed Cas9-guide RNA complexes followed by sequencing with barcoded linked reads. To test this approach, we designed three assays, targeting the BRCA1 gene, the entire 4-Mb major histocompatibility complex locus and 18 well-characterized SVs, respectively. Using an integrated alignment- and assembly-based approach, we generated comprehensive variant diplotypes spanning the entirety of the targeted loci and characterized SVs with exact breakpoints. Our results were comparable in quality to long read sequencing.

SUBMITTER: Shin G 

PROVIDER: S-EPMC6821272 | biostudies-literature | 2019 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Targeted short read sequencing and assembly of re-arrangements and candidate gene loci provide megabase diplotypes.

Shin GiWon G   Greer Stephanie U SU   Xia Li C LC   Lee HoJoon H   Zhou Jun J   Boles T Christian TC   Ji Hanlee P HP  

Nucleic acids research 20191101 19


The human genome is composed of two haplotypes, otherwise called diplotypes, which denote phased polymorphisms and structural variations (SVs) that are derived from both parents. Diplotypes place genetic variants in the context of cis-related variants from a diploid genome. As a result, they provide valuable information about hereditary transmission, context of SV, regulation of gene expression and other features which are informative for understanding human genetics. Successful diplotyping with  ...[more]

Similar Datasets

| S-EPMC5701471 | biostudies-literature
| S-EPMC4622496 | biostudies-literature
| S-EPMC8575027 | biostudies-literature
| S-EPMC3276136 | biostudies-literature
2020-08-24 | GSE138137 | GEO
| S-EPMC3309356 | biostudies-literature
| S-EPMC3167803 | biostudies-literature
| S-EPMC3227110 | biostudies-literature
| S-EPMC2813482 | biostudies-literature
| S-EPMC3663818 | biostudies-literature