Unknown

Dataset Information

0

UniProt genomic mapping for deciphering functional effects of missense variants.


ABSTRACT: Understanding the association of genetic variation with its functional consequences in proteins is essential for the interpretation of genomic data and identifying causal variants in diseases. Integration of protein function knowledge with genome annotation can assist in rapidly comprehending genetic variation within complex biological processes. Here, we describe mapping UniProtKB human sequences and positional annotations, such as active sites, binding sites, and variants to the human genome (GRCh38) and the release of a public genome track hub for genome browsers. To demonstrate the power of combining protein annotations with genome annotations for functional interpretation of variants, we present specific biological examples in disease-related genes and proteins. Computational comparisons of UniProtKB annotations and protein variants with ClinVar clinically annotated single nucleotide polymorphism (SNP) data show that 32% of UniProtKB variants colocate with 8% of ClinVar SNPs. The majority of colocated UniProtKB disease-associated variants (86%) map to 'pathogenic' ClinVar SNPs. UniProt and ClinVar are collaborating to provide a unified clinical variant annotation for genomic, protein, and clinical researchers. The genome track hubs, and related UniProtKB files, are downloadable from the UniProt FTP site and discoverable as public track hubs at the UCSC and Ensembl genome browsers.

SUBMITTER: McGarvey PB 

PROVIDER: S-EPMC6563471 | biostudies-literature | 2019 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

UniProt genomic mapping for deciphering functional effects of missense variants.

McGarvey Peter B PB   Nightingale Andrew A   Luo Jie J   Huang Hongzhan H   Martin Maria J MJ   Wu Cathy C  

Human mutation 20190403 6


Understanding the association of genetic variation with its functional consequences in proteins is essential for the interpretation of genomic data and identifying causal variants in diseases. Integration of protein function knowledge with genome annotation can assist in rapidly comprehending genetic variation within complex biological processes. Here, we describe mapping UniProtKB human sequences and positional annotations, such as active sites, binding sites, and variants to the human genome (  ...[more]

Similar Datasets

| S-EPMC5740498 | biostudies-literature
| S-EPMC7359949 | biostudies-literature
| S-EPMC10838372 | biostudies-literature
| S-EPMC9262418 | biostudies-literature
| PRJEB46587 | ENA
| S-EPMC9250003 | biostudies-literature
| S-EPMC2367626 | biostudies-literature
| S-EPMC10034637 | biostudies-literature
2020-12-23 | GSE162130 | GEO
| S-EPMC2567350 | biostudies-literature