Unknown

Dataset Information

0

Precise detection of de novo single nucleotide variants in human genomes.


ABSTRACT: The precise determination of de novo genetic variants has enormous implications across different fields of biology and medicine, particularly personalized medicine. Currently, de novo variations are identified by mapping sample reads from a parent-offspring trio to a reference genome, allowing for a certain degree of differences. While widely used, this approach often introduces false-positive (FP) results due to misaligned reads and mischaracterized sequencing errors. In a previous study, we developed an alternative approach to accurately identify single nucleotide variants (SNVs) using only perfect matches. However, this approach could be applied only to haploid regions of the genome and was computationally intensive. In this study, we present a unique approach, coverage-based single nucleotide variant identification (COBASI), which allows the exploration of the entire genome using second-generation short sequence reads without extensive computing requirements. COBASI identifies SNVs using changes in coverage of exactly matching unique substrings, and is particularly suited for pinpointing de novo SNVs. Unlike other approaches that require population frequencies across hundreds of samples to filter out any methodological biases, COBASI can be applied to detect de novo SNVs within isolated families. We demonstrate this capability through extensive simulation studies and by studying a parent-offspring trio we sequenced using short reads. Experimental validation of all 58 candidate de novo SNVs and a selection of non-de novo SNVs found in the trio confirmed zero FP calls. COBASI is available as open source at https://github.com/Laura-Gomez/COBASI for any researcher to use.

SUBMITTER: Gomez-Romero L 

PROVIDER: S-EPMC6003530 | biostudies-literature | 2018 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

Precise detection of de novo single nucleotide variants in human genomes.

Gómez-Romero Laura L   Palacios-Flores Kim K   Reyes José J   García Delfino D   Boege Margareta M   Dávila Guillermo G   Flores Margarita M   Schatz Michael C MC   Palacios Rafael R  

Proceedings of the National Academy of Sciences of the United States of America 20180507 21


The precise determination of de novo genetic variants has enormous implications across different fields of biology and medicine, particularly personalized medicine. Currently, de novo variations are identified by mapping sample reads from a parent-offspring trio to a reference genome, allowing for a certain degree of differences. While widely used, this approach often introduces false-positive (FP) results due to misaligned reads and mischaracterized sequencing errors. In a previous study, we de  ...[more]

Similar Datasets

2019-07-01 | GSE122954 | GEO
| S-EPMC4745987 | biostudies-literature
| S-EPMC8041623 | biostudies-literature
| S-EPMC5946924 | biostudies-literature
| S-EPMC3134529 | biostudies-literature
| S-EPMC9246377 | biostudies-literature
| S-EPMC3198378 | biostudies-literature
| S-EPMC2813482 | biostudies-literature
2011-04-30 | GSE23765 | GEO