Genomics

Dataset Information

0

Long-read trio sequencing of unsolved patients with intellectual disability


ABSTRACT: Long-read sequencing (LRS) has the potential to comprehensively identify all medically-relevant genome variation, including variation commonly missed by short read sequencing (SRS) approaches. To determine this potential, we performed LRS up to 40x coverage using the Pacific Biosciences Sequel instrument for 5 patient-parent trios with intellectual disability, whose aetiology remained unresolved after SRS exomes and genomes. A first systematic assessment of LRS coverage showed that ~35Mb of the genome was only accessible by LRS and not SRS. Structural variant (SV) calling yielded on average 28,292 SV calls per individual, totalling 12.9 MB of sequence. Trio-based analyses showed concordance for up to 95% of these SV calls across the genome, and 80% of the LRS SV calls were not identified by SRS. De novo mutation analysis did not identify any de novo SVs, confirming these are rare events. Because of high sequence coverage, we were also able to call single nucleotide substitutions. On average, we identified 3 million substitutions per genome, with a Mendelian inheritance concordance of up to 97%. Of these, approximately 100,000 were located in the ~35Mb of the genome that was only captured by LRS. Moreover, these variants affected the coding sequence of 64 genes, including 32 known to cause Mendelian disorders. Although no clinical diagnoses were made in these 5 patients, our data show the potential added value of LRS compared to SRS for identifying medically relevant genome variation.

PROVIDER: EGAS00001004319 | EGA |

REPOSITORIES: EGA

Similar Datasets

2017-02-10 | GSE94107 | GEO
2010-02-01 | GSE18542 | GEO
2013-04-01 | GSE41160 | GEO
2019-01-17 | GSE125209 | GEO
2019-01-17 | GSE125208 | GEO
2019-01-17 | GSE125207 | GEO
2013-02-12 | E-GEOD-20289 | biostudies-arrayexpress
2010-02-01 | E-GEOD-18542 | biostudies-arrayexpress
2018-06-26 | GSE105092 | GEO
2012-10-10 | GSE36132 | GEO