Unknown

Dataset Information

0

SVachra: a tool to identify genomic structural variation in mate pair sequencing data containing inward and outward facing reads.


ABSTRACT: Characterization of genomic structural variation (SV) is essential to expanding the research and clinical applications of genome sequencing. Reliance upon short DNA fragment paired end sequencing has yielded a wealth of single nucleotide variants and internal sequencing read insertions-deletions, at the cost of limited SV detection. Multi-kilobase DNA fragment mate pair sequencing has supplemented the void in SV detection, but introduced new analytic challenges requiring SV detection tools specifically designed for mate pair sequencing data. Here, we introduce SVachra - Structural Variation Assessment of CHRomosomal Aberrations, a breakpoint calling program that identifies large insertions-deletions, inversions, inter- and intra-chromosomal translocations utilizing both inward and outward facing read types generated by mate pair sequencing.We demonstrate SVachra's utility by executing the program on large-insert (Illumina Nextera) mate pair sequencing data from the personal genome of a single subject (HS1011). An additional data set of long-read (Pacific BioSciences RSII) was also generated to validate SV calls from SVachra and other comparison SV calling programs. SVachra exhibited the highest validation rate and reported the widest distribution of SV types and size ranges when compared to other SV callers.SVachra is a highly specific breakpoint calling program that exhibits a more unbiased SV detection methodology than other callers.

SUBMITTER: Hampton OA 

PROVIDER: S-EPMC5629590 | biostudies-literature | 2017 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

SVachra: a tool to identify genomic structural variation in mate pair sequencing data containing inward and outward facing reads.

Hampton Oliver A OA   English Adam C AC   Wang Mark M   Salerno William J WJ   Liu Yue Y   Muzny Donna M DM   Han Yi Y   Wheeler David A DA   Worley Kim C KC   Lupski James R JR   Gibbs Richard A RA  

BMC genomics 20171003 Suppl 6


<h4>Background</h4>Characterization of genomic structural variation (SV) is essential to expanding the research and clinical applications of genome sequencing. Reliance upon short DNA fragment paired end sequencing has yielded a wealth of single nucleotide variants and internal sequencing read insertions-deletions, at the cost of limited SV detection. Multi-kilobase DNA fragment mate pair sequencing has supplemented the void in SV detection, but introduced new analytic challenges requiring SV de  ...[more]

Similar Datasets

| S-EPMC6589766 | biostudies-literature
| S-EPMC7428307 | biostudies-literature
| S-EPMC6325147 | biostudies-other
| S-EPMC5519378 | biostudies-literature
| S-EPMC3390859 | biostudies-other
| S-EPMC4816688 | biostudies-literature
| S-EPMC4633820 | biostudies-literature
| S-EPMC2905550 | biostudies-other
| S-EPMC6986796 | biostudies-literature
| PRJEB4453 | ENA