Unknown

Dataset Information

0

SoftSearch: integration of multiple sequence features to identify breakpoints of structural variations.


ABSTRACT: BACKGROUND: Structural variation (SV) represents a significant, yet poorly understood contribution to an individual's genetic makeup. Advanced next-generation sequencing technologies are widely used to discover such variations, but there is no single detection tool that is considered a community standard. In an attempt to fulfil this need, we developed an algorithm, SoftSearch, for discovering structural variant breakpoints in Illumina paired-end next-generation sequencing data. SoftSearch combines multiple strategies for detecting SV including split-read, discordant read-pair, and unmated pairs. Co-localized split-reads and discordant read pairs are used to refine the breakpoints. RESULTS: We developed and validated SoftSearch using real and synthetic datasets. SoftSearch's key features are 1) not requiring secondary (or exhaustive primary) alignment, 2) portability into established sequencing workflows, and 3) is applicable to any DNA-sequencing experiment (e.g. whole genome, exome, custom capture, etc.). SoftSearch identifies breakpoints from a small number of soft-clipped bases from split reads and a few discordant read-pairs which on their own would not be sufficient to make an SV call. CONCLUSIONS: We show that SoftSearch can identify more true SVs by combining multiple sequence features. SoftSearch was able to call clinically relevant SVs in the BRCA2 gene not reported by other tools while offering significantly improved overall performance.

SUBMITTER: Hart SN 

PROVIDER: S-EPMC3865185 | biostudies-literature | 2013

REPOSITORIES: biostudies-literature

altmetric image

Publications

SoftSearch: integration of multiple sequence features to identify breakpoints of structural variations.

Hart Steven N SN   Sarangi Vivekananda V   Moore Raymond R   Baheti Saurabh S   Bhavsar Jaysheel D JD   Couch Fergus J FJ   Kocher Jean-Pierre A JP  

PloS one 20131216 12


<h4>Background</h4>Structural variation (SV) represents a significant, yet poorly understood contribution to an individual's genetic makeup. Advanced next-generation sequencing technologies are widely used to discover such variations, but there is no single detection tool that is considered a community standard. In an attempt to fulfil this need, we developed an algorithm, SoftSearch, for discovering structural variant breakpoints in Illumina paired-end next-generation sequencing data. SoftSearc  ...[more]

Similar Datasets

| S-EPMC9845341 | biostudies-literature
2022-03-07 | PXD031606 | Pride
| S-EPMC2692051 | biostudies-literature
| S-EPMC8927474 | biostudies-literature
| S-EPMC3926948 | biostudies-literature
| S-EPMC6114881 | biostudies-literature
| S-EPMC7038560 | biostudies-literature
| S-EPMC10099770 | biostudies-literature
| S-EPMC7291640 | biostudies-literature
| S-EPMC9971630 | biostudies-literature