Unknown

Dataset Information

0

Denoising of Aligned Genomic Data.


ABSTRACT: Noise in genomic sequencing data is known to have effects on various stages of genomic data analysis pipelines. Variant identification is an important step of many of these pipelines, and is increasingly being used in clinical settings to aid medical practices. We propose a denoising method, dubbed SAMDUDE, which operates on aligned genomic data in order to improve variant calling performance. Denoising human data with SAMDUDE resulted in improved variant identification in both individual chromosome as well as whole genome sequencing (WGS) data sets. In the WGS data set, denoising led to identification of almost 2,000 additional true variants, and elimination of over 1,500 erroneously identified variants. In contrast, we found that denoising with other state-of-the-art denoisers significantly worsens variant calling performance. SAMDUDE is written in Python and is freely available at https://github.com/ihwang/SAMDUDE .

SUBMITTER: Fischer-Hwang I 

PROVIDER: S-EPMC6803637 | biostudies-literature | 2019 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

Denoising of Aligned Genomic Data.

Fischer-Hwang Irena I   Ochoa Idoia I   Weissman Tsachy T   Hernaez Mikel M  

Scientific reports 20191021 1


Noise in genomic sequencing data is known to have effects on various stages of genomic data analysis pipelines. Variant identification is an important step of many of these pipelines, and is increasingly being used in clinical settings to aid medical practices. We propose a denoising method, dubbed SAMDUDE, which operates on aligned genomic data in order to improve variant calling performance. Denoising human data with SAMDUDE resulted in improved variant identification in both individual chromo  ...[more]

Similar Datasets

| S-EPMC5666573 | biostudies-literature
| S-EPMC3563472 | biostudies-literature
| S-EPMC3110192 | biostudies-literature
| S-EPMC10092037 | biostudies-literature
| S-EPMC4881387 | biostudies-literature
| S-EPMC7781045 | biostudies-literature
| S-EPMC3607570 | biostudies-literature
| S-EPMC8762370 | biostudies-literature
| S-EPMC9477507 | biostudies-literature
| S-EPMC8733986 | biostudies-literature