Unknown

Dataset Information

0

A practical method to detect SNVs and indels from whole genome and exome sequencing data.


ABSTRACT: The recent development of massively parallel sequencing technology has allowed the creation of comprehensive catalogs of genetic variation. However, due to the relatively high sequencing error rate for short read sequence data, sophisticated analysis methods are required to obtain high-quality variant calls. Here, we developed a probabilistic multinomial method for the detection of single nucleotide variants (SNVs) as well as short insertions and deletions (indels) in whole genome sequencing (WGS) and whole exome sequencing (WES) data for single sample calling. Evaluation with DNA genotyping arrays revealed a concordance rate of 99.98% for WGS calls and 99.99% for WES calls. Sanger sequencing of the discordant calls determined the false positive and false negative rates for the WGS (0.0068% and 0.17%) and WES (0.0036% and 0.0084%) datasets. Furthermore, short indels were identified with high accuracy (WGS: 94.7%, WES: 97.3%). We believe our method can contribute to the greater understanding of human diseases.

SUBMITTER: Shigemizu D 

PROVIDER: S-EPMC3703611 | biostudies-literature | 2013

REPOSITORIES: biostudies-literature

altmetric image

Publications

A practical method to detect SNVs and indels from whole genome and exome sequencing data.

Shigemizu Daichi D   Fujimoto Akihiro A   Akiyama Shintaro S   Abe Tetsuo T   Nakano Kaoru K   Boroevich Keith A KA   Yamamoto Yujiro Y   Furuta Mayuko M   Kubo Michiaki M   Nakagawa Hidewaki H   Tsunoda Tatsuhiko T  

Scientific reports 20130101


The recent development of massively parallel sequencing technology has allowed the creation of comprehensive catalogs of genetic variation. However, due to the relatively high sequencing error rate for short read sequence data, sophisticated analysis methods are required to obtain high-quality variant calls. Here, we developed a probabilistic multinomial method for the detection of single nucleotide variants (SNVs) as well as short insertions and deletions (indels) in whole genome sequencing (WG  ...[more]

Similar Datasets

| S-EPMC5549930 | biostudies-other
| S-EPMC4253833 | biostudies-other
| S-EPMC4146529 | biostudies-literature
| S-EPMC8155357 | biostudies-literature
| S-EPMC5865163 | biostudies-other
| S-EPMC8686574 | biostudies-literature
| S-EPMC5431795 | biostudies-literature
| S-EPMC4375422 | biostudies-literature
| S-EPMC4240813 | biostudies-literature