Unknown

Dataset Information

0

NGSEP3: accurate variant calling across species and sequencing protocols.


ABSTRACT:

Motivation

Accurate detection, genotyping and downstream analysis of genomic variants from high-throughput sequencing data are fundamental features in modern production pipelines for genetic-based diagnosis in medicine or genomic selection in plant and animal breeding. Our research group maintains the Next-Generation Sequencing Experience Platform (NGSEP) as a precise, efficient and easy-to-use software solution for these features.

Results

Understanding that incorrect alignments around short tandem repeats are an important source of genotyping errors, we implemented in NGSEP new algorithms for realignment and haplotype clustering of reads spanning indels and short tandem repeats. We performed extensive benchmark experiments comparing NGSEP to state-of-the-art software using real data from three sequencing protocols and four species with different distributions of repetitive elements. NGSEP consistently shows comparative accuracy and better efficiency compared to the existing solutions. We expect that this work will contribute to the continuous improvement of quality in variant calling needed for modern applications in medicine and agriculture.

Availability and implementation

NGSEP is available as open source software at http://ngsep.sf.net.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Tello D 

PROVIDER: S-EPMC6853766 | biostudies-literature | 2019 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

NGSEP3: accurate variant calling across species and sequencing protocols.

Tello Daniel D   Gil Juanita J   Loaiza Cristian D CD   Riascos John J JJ   Cardozo Nicolás N   Duitama Jorge J  

Bioinformatics (Oxford, England) 20191101 22


<h4>Motivation</h4>Accurate detection, genotyping and downstream analysis of genomic variants from high-throughput sequencing data are fundamental features in modern production pipelines for genetic-based diagnosis in medicine or genomic selection in plant and animal breeding. Our research group maintains the Next-Generation Sequencing Experience Platform (NGSEP) as a precise, efficient and easy-to-use software solution for these features.<h4>Results</h4>Understanding that incorrect alignments a  ...[more]

Similar Datasets

| S-EPMC8602313 | biostudies-literature
| S-EPMC7751401 | biostudies-literature
| S-EPMC6788989 | biostudies-literature
| S-EPMC10777354 | biostudies-literature
| S-EPMC10311303 | biostudies-literature
| S-EPMC6341484 | biostudies-literature
| S-EPMC9900919 | biostudies-literature
| S-EPMC11322167 | biostudies-literature
| S-EPMC11246426 | biostudies-literature
| S-EPMC7576216 | biostudies-literature