Unknown

Dataset Information

0

DbVar structural variant cluster set for data analysis and variant comparison.


ABSTRACT: dbVar houses over 3 million submitted structural variants (SSV) from 120 human studies including copy number variations (CNV), insertions, deletions, inversions, translocations, and complex chromosomal rearrangements. Users can submit multiple SSVs to dbVAR  that are presumably identical, but were ascertained by different platforms and samples,  to calculate whether the variant is rare or common in the population and allow for cross validation. However, because SSV genomic location reporting can vary - including fuzzy locations where the start and/or end points are not precisely known - analysis, comparison, annotation, and reporting of SSVs across studies can be difficult. This project was initiated by the Structural Variant Comparison Group for the purpose of generating a non-redundant set of genomic regions defined by counts of concordance for all human SSVs placed on RefSeq assembly GRCh38 (RefSeq accession GCF_000001405.26). We intend that the availability of these regions, called structural variant clusters (SVCs), will facilitate the analysis, annotation, and exchange of SV data and allow for simplified display in genomic sequence viewers for improved variant interpretation. Sets of SVCs were generated by variant type for each of the 120 studies as well as for a combined set across all studies. Starting from 3.64 million SSVs, 2.5 million and 3.4 million non-redundant SVCs with count >=1 were generated by variant type for each study and across all studies, respectively. In addition, we have developed utilities for annotating, searching, and filtering SVC data in GVF format for computing summary statistics, exporting data for genomic viewers, and annotating the SVC using external data sources.

SUBMITTER: Phan L 

PROVIDER: S-EPMC5345777 | biostudies-literature | 2016

REPOSITORIES: biostudies-literature

altmetric image

Publications

dbVar structural variant cluster set for data analysis and variant comparison.

Phan Lon L   Hsu Jeffrey J   Tri Le Quang Minh LQ   Willi Michaela M   Mansour Tamer T   Kai Yan Y   Garner John J   Lopez John J   Busby Ben B  

F1000Research 20160413


dbVar houses over 3 million submitted structural variants (SSV) from 120 human studies including copy number variations (CNV), insertions, deletions, inversions, translocations, and complex chromosomal rearrangements. Users can submit multiple SSVs to dbVAR  that are presumably identical, but were ascertained by different platforms and samples,  to calculate whether the variant is rare or common in the population and allow for cross validation. However, because SSV genomic location reporting can  ...[more]

Similar Datasets

| S-EPMC4306386 | biostudies-literature
| S-EPMC10006329 | biostudies-literature
| S-EPMC6821325 | biostudies-literature
| S-EPMC1088301 | biostudies-literature
| S-EPMC9793516 | biostudies-literature
| S-EPMC5100683 | biostudies-literature
| S-EPMC5984476 | biostudies-literature
| S-EPMC9825616 | biostudies-literature
| S-EPMC7532444 | biostudies-literature
| S-EPMC3021666 | biostudies-literature