Unknown

Dataset Information

0

ABRA: improved coding indel detection via assembly-based realignment.


ABSTRACT: Variant detection from next-generation sequencing (NGS) data is an increasingly vital aspect of disease diagnosis, treatment and research. Commonly used NGS-variant analysis tools generally rely on accurately mapped short reads to identify somatic variants and germ-line genotypes. Existing NGS read mappers have difficulty accurately mapping short reads containing complex variation (i.e. more than a single base change), thus making identification of such variants difficult or impossible. Insertions and deletions (indels) in particular have been an area of great difficulty. Indels are frequent and can have substantial impact on function, which makes their detection all the more imperative.We present ABRA, an assembly-based realigner, which uses an efficient and flexible localized de novo assembly followed by global realignment to more accurately remap reads. This results in enhanced performance for indel detection as well as improved accuracy in variant allele frequency estimation.ABRA is implemented in a combination of Java and C/C++ and is freely available for download at https://github.com/mozack/abra.

SUBMITTER: Mose LE 

PROVIDER: S-EPMC4173014 | biostudies-literature | 2014 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

ABRA: improved coding indel detection via assembly-based realignment.

Mose Lisle E LE   Wilkerson Matthew D MD   Hayes D Neil DN   Perou Charles M CM   Parker Joel S JS  

Bioinformatics (Oxford, England) 20140606 19


<h4>Motivation</h4>Variant detection from next-generation sequencing (NGS) data is an increasingly vital aspect of disease diagnosis, treatment and research. Commonly used NGS-variant analysis tools generally rely on accurately mapped short reads to identify somatic variants and germ-line genotypes. Existing NGS read mappers have difficulty accurately mapping short reads containing complex variation (i.e. more than a single base change), thus making identification of such variants difficult or i  ...[more]

Similar Datasets

| S-EPMC6735753 | biostudies-literature
| S-EPMC4671222 | biostudies-literature
2022-05-11 | GSE196416 | GEO
| PRJNA804865 | ENA
| S-EPMC4538813 | biostudies-literature
| PRJNA1027083 | ENA
| PRJEB5466 | ENA
| S-EPMC3557041 | biostudies-literature
| S-EPMC8193930 | biostudies-literature
| S-EPMC4270401 | biostudies-literature