Unknown

Dataset Information

0

FusorSV: an algorithm for optimally combining data from multiple structural variation detection methods.


ABSTRACT: Comprehensive and accurate identification of structural variations (SVs) from next generation sequencing data remains a major challenge. We develop FusorSV, which uses a data mining approach to assess performance and merge callsets from an ensemble of SV-calling algorithms. It includes a fusion model built using analysis of 27 deep-coverage human genomes from the 1000 Genomes Project. We identify 843 novel SV calls that were not reported by the 1000 Genomes Project for these 27 samples. Experimental validation of a subset of these calls yields a validation rate of 86.7%. FusorSV is available at https://github.com/TheJacksonLaboratory/SVE .

SUBMITTER: Becker T 

PROVIDER: S-EPMC5859555 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC5096953 | biostudies-literature
| S-EPMC3617433 | biostudies-other
| S-EPMC1914088 | biostudies-literature
| S-EPMC4832315 | biostudies-literature
| S-EPMC3046488 | biostudies-literature
| S-EPMC4330340 | biostudies-literature
| S-EPMC3910607 | biostudies-literature
| S-EPMC2758278 | biostudies-literature
| S-EPMC7589535 | biostudies-literature
| S-EPMC6468241 | biostudies-literature