Unknown

Dataset Information

0

BatAlign: an incremental method for accurate alignment of sequencing reads.


ABSTRACT: Structural variations (SVs) play a crucial role in genetic diversity. However, the alignments of reads near/across SVs are made inaccurate by the presence of polymorphisms. BatAlign is an algorithm that integrated two strategies called 'Reverse-Alignment' and 'Deep-Scan' to improve the accuracy of read-alignment. In our experiments, BatAlign was able to obtain the highest F-measures in read-alignments on mismatch-aberrant, indel-aberrant, concordantly/discordantly paired and SV-spanning data sets. On real data, the alignments of BatAlign were able to recover 4.3% more PCR-validated SVs with 73.3% less callings. These suggest BatAlign to be effective in detecting SVs and other polymorphic-variants accurately using high-throughput data. BatAlign is publicly available at https://goo.gl/a6phxB.

SUBMITTER: Lim JQ 

PROVIDER: S-EPMC4652746 | biostudies-literature | 2015 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

BatAlign: an incremental method for accurate alignment of sequencing reads.

Lim Jing-Quan JQ   Tennakoon Chandana C   Guan Peiyong P   Sung Wing-Kin WK  

Nucleic acids research 20150713 16


Structural variations (SVs) play a crucial role in genetic diversity. However, the alignments of reads near/across SVs are made inaccurate by the presence of polymorphisms. BatAlign is an algorithm that integrated two strategies called 'Reverse-Alignment' and 'Deep-Scan' to improve the accuracy of read-alignment. In our experiments, BatAlign was able to obtain the highest F-measures in read-alignments on mismatch-aberrant, indel-aberrant, concordantly/discordantly paired and SV-spanning data set  ...[more]

Similar Datasets

| S-EPMC8256862 | biostudies-literature
| S-EPMC5451431 | biostudies-literature
2016-08-14 | E-GEOD-84170 | biostudies-arrayexpress
| S-EPMC5029459 | biostudies-literature
| S-EPMC10584998 | biostudies-literature
2016-08-14 | GSE84170 | GEO
| S-EPMC10423031 | biostudies-literature
| S-EPMC3118166 | biostudies-literature
| S-EPMC8179091 | biostudies-literature
| S-EPMC4970289 | biostudies-literature