Unknown

Dataset Information

0

Combining probabilistic alignments with read pair information improves accuracy of split-alignments.


ABSTRACT: Motivation:Split-alignments provide base-pair-resolution evidence of genomic rearrangements. In practice, they are found by first computing high-scoring local alignments, parts of which are then combined into a split-alignment. This approach is challenging when aligning a short read to a large and repetitive reference, as it tends to produce many spurious local alignments leading to ambiguities in identifying the correct split-alignment. This problem is further exacerbated by the fact that rearrangements tend to occur in repeat-rich regions. Results:We propose a split-alignment technique that combats the issue of ambiguous alignments by combining information from probabilistic alignment with positional information from paired-end reads. We demonstrate that our method finds accurate split-alignments, and that this translates into improved performance of variant-calling tools that rely on split-alignments. Availability and implementation:An open-source implementation is freely available at: https://bitbucket.org/splitpairedend/last-split-pe. Supplementary information:Supplementary data are available at Bioinformatics online.

SUBMITTER: Shrestha AMS 

PROVIDER: S-EPMC6198854 | biostudies-other | 2018 Nov

REPOSITORIES: biostudies-other

altmetric image

Publications

Combining probabilistic alignments with read pair information improves accuracy of split-alignments.

Shrestha Anish M S AMS   Yoshikawa Naruki N   Asai Kiyoshi K  

Bioinformatics (Oxford, England) 20181101 21


<h4>Motivation</h4>Split-alignments provide base-pair-resolution evidence of genomic rearrangements. In practice, they are found by first computing high-scoring local alignments, parts of which are then combined into a split-alignment. This approach is challenging when aligning a short read to a large and repetitive reference, as it tends to produce many spurious local alignments leading to ambiguities in identifying the correct split-alignment. This problem is further exacerbated by the fact th  ...[more]

Similar Datasets

| S-EPMC10925290 | biostudies-literature
| S-EPMC3821552 | biostudies-literature
| S-EPMC6789115 | biostudies-literature
| S-EPMC2335322 | biostudies-literature
| S-EPMC7814537 | biostudies-literature
| S-EPMC7727374 | biostudies-literature
| S-EPMC5767225 | biostudies-literature
| S-EPMC6157255 | biostudies-literature
| S-EPMC1584415 | biostudies-literature
| S-EPMC6912988 | biostudies-literature