Unknown

Dataset Information

0

MetaSV: an accurate and integrative structural-variant caller for next generation sequencing.


ABSTRACT:

Unlabelled

Structural variations (SVs) are large genomic rearrangements that vary significantly in size, making them challenging to detect with the relatively short reads from next-generation sequencing (NGS). Different SV detection methods have been developed; however, each is limited to specific kinds of SVs with varying accuracy and resolution. Previous works have attempted to combine different methods, but they still suffer from poor accuracy particularly for insertions. We propose MetaSV, an integrated SV caller which leverages multiple orthogonal SV signals for high accuracy and resolution. MetaSV proceeds by merging SVs from multiple tools for all types of SVs. It also analyzes soft-clipped reads from alignment to detect insertions accurately since existing tools underestimate insertion SVs. Local assembly in combination with dynamic programming is used to improve breakpoint resolution. Paired-end and coverage information is used to predict SV genotypes. Using simulation and experimental data, we demonstrate the effectiveness of MetaSV across various SV types and sizes.

Availability and implementation

Code in Python is at http://bioinform.github.io/metasv/.

Contact

rd@bina.com

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Mohiyuddin M 

PROVIDER: S-EPMC4528635 | biostudies-literature | 2015 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

MetaSV: an accurate and integrative structural-variant caller for next generation sequencing.

Mohiyuddin Marghoob M   Mu John C JC   Li Jian J   Bani Asadi Narges N   Gerstein Mark B MB   Abyzov Alexej A   Wong Wing H WH   Lam Hugo Y K HY  

Bioinformatics (Oxford, England) 20150410 16


<h4>Unlabelled</h4>Structural variations (SVs) are large genomic rearrangements that vary significantly in size, making them challenging to detect with the relatively short reads from next-generation sequencing (NGS). Different SV detection methods have been developed; however, each is limited to specific kinds of SVs with varying accuracy and resolution. Previous works have attempted to combine different methods, but they still suffer from poor accuracy particularly for insertions. We propose M  ...[more]

Similar Datasets

| S-EPMC6499249 | biostudies-literature
| S-EPMC4550471 | biostudies-literature
| S-EPMC7182099 | biostudies-literature
| S-EPMC4914105 | biostudies-literature
| S-EPMC3292476 | biostudies-literature
| S-EPMC2978646 | biostudies-literature
| S-EPMC6477992 | biostudies-literature
| S-EPMC3534403 | biostudies-literature
| S-EPMC3410788 | biostudies-other
2017-04-03 | PXD003804 | Pride