Unknown

Dataset Information

0

ClinVar data parsing.


ABSTRACT: This software repository provides a pipeline for converting raw ClinVar data files into analysis-friendly tab-delimited tables, and also provides these tables for the most recent ClinVar release. Separate tables are generated for genome builds GRCh37 and GRCh38 as well as for mono-allelic variants and complex multi-allelic variants. Additionally, the tables are augmented with allele frequencies from the ExAC and gnomAD datasets as these are often consulted when analyzing ClinVar variants. Overall, this work provides ClinVar data in a format that is easier to work with and can be directly loaded into a variety of popular analysis tools such as R, python pandas, and SQL databases.

SUBMITTER: Zhang X 

PROVIDER: S-EPMC5473414 | biostudies-literature | 2017

REPOSITORIES: biostudies-literature

altmetric image

Publications


This software repository provides a pipeline for converting raw ClinVar data files into analysis-friendly tab-delimited tables, and also provides these tables for the most recent ClinVar release. Separate tables are generated for genome builds GRCh37 and GRCh38 as well as for mono-allelic variants and complex multi-allelic variants. Additionally, the tables are augmented with allele frequencies from the ExAC and gnomAD datasets as these are often consulted when analyzing ClinVar variants. Overal  ...[more]

Similar Datasets

| S-EPMC10715767 | biostudies-literature
| S-EPMC2956011 | biostudies-literature
| S-EPMC10879749 | biostudies-literature
| S-EPMC2734164 | biostudies-literature
| S-EPMC7751177 | biostudies-literature
| S-EPMC8783432 | biostudies-literature
2018-09-18 | PXD011070 | Pride
2018-09-18 | PXD011069 | Pride
| S-EPMC6206854 | biostudies-literature
| S-EPMC4832236 | biostudies-literature