Unknown

Dataset Information

0

Bivartect: accurate and memory-saving breakpoint detection by direct read comparison.


ABSTRACT: MOTIVATION:Genetic variant calling with high-throughput sequencing data has been recognized as a useful tool for better understanding of disease mechanism and detection of potential off-target sites in genome editing. Since most of the variant calling algorithms rely on initial mapping onto a reference genome and tend to predict many variant candidates, variant calling remains challenging in terms of predicting variants with low false positives. RESULTS:Here we present Bivartect, a simple yet versatile variant caller based on direct comparison of short sequence reads between normal and mutated samples. Bivartect can detect not only single nucleotide variants but also insertions/deletions, inversions and their complexes. Bivartect achieves high predictive performance with an elaborate memory-saving mechanism, which allows Bivartect to run on a computer with a single node for analyzing small omics data. Tests with simulated benchmark and real genome-editing data indicate that Bivartect was comparable to state-of-the-art variant callers in positive predictive value for detection of single nucleotide variants, even though it yielded a substantially small number of candidates. These results suggest that Bivartect, a reference-free approach, will contribute to the identification of germline mutations as well as off-target sites introduced during genome editing with high accuracy. AVAILABILITY AND IMPLEMENTATION:Bivartect is implemented in C++ and available along with in silico simulated data at https://github.com/ykat0/bivartect. SUPPLEMENTARY INFORMATION:Supplementary data are available at Bioinformatics online.

SUBMITTER: Shimmura K 

PROVIDER: S-EPMC7203739 | biostudies-literature | 2020 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

Bivartect: accurate and memory-saving breakpoint detection by direct read comparison.

Shimmura Keisuke K   Kato Yuki Y   Kawahara Yukio Y  

Bioinformatics (Oxford, England) 20200501 9


<h4>Motivation</h4>Genetic variant calling with high-throughput sequencing data has been recognized as a useful tool for better understanding of disease mechanism and detection of potential off-target sites in genome editing. Since most of the variant calling algorithms rely on initial mapping onto a reference genome and tend to predict many variant candidates, variant calling remains challenging in terms of predicting variants with low false positives.<h4>Results</h4>Here we present Bivartect,  ...[more]

Similar Datasets

| S-EPMC6044046 | biostudies-literature
| S-EPMC2752127 | biostudies-literature
| S-EPMC11246426 | biostudies-literature
| S-EPMC4253826 | biostudies-other
| S-EPMC6776680 | biostudies-literature
| S-EPMC3436849 | biostudies-literature
| S-EPMC7057128 | biostudies-literature
| S-EPMC3106329 | biostudies-literature
| S-EPMC5657622 | biostudies-literature
| S-EPMC7004874 | biostudies-literature