Unknown

Dataset Information

0

M2SG: mapping human disease-related genetic variants to protein sequences and genomic loci.


ABSTRACT: Online Mendelian Inheritance in Man (OMIM) is a manually curated compendium of human genetic variants and the corresponding phenotypes, mostly human diseases. Instead of directly documenting the native sequences for gene entries, OMIM links its entries to protein and DNA sequences in other databases. However, because of the existence of gene isoforms and errors in OMIM records, mapping a specific OMIM mutation to its corresponding protein sequence is not trivial. Combining computer programs and extensive manual curation of OMIM full-text descriptions and original literature, we mapped 98% of OMIM amino acid substitutions (AASs) and all SwissProt Variant (SwissVar) disease-related AASs to reference sequences and confidently mapped 99.96% of all AASs to the genomic loci. Based on the results, we developed an online database and interactive web server (M2SG) to (i) retrieve the mapped OMIM and SwissVar variants for a given protein sequence; and (ii) obtain related proteins and mutations for an input disease phenotype. This database will be useful for analyzing sequences, understanding the effect of mutations, identifying important genetic variations and designing experiments on a protein of interest.The database and web server are freely available at http://prodata.swmed.edu/M2S/mut2seq.cgi.

SUBMITTER: Ji R 

PROVIDER: S-EPMC3810852 | biostudies-other | 2013 Nov

REPOSITORIES: biostudies-other

altmetric image

Publications

M2SG: mapping human disease-related genetic variants to protein sequences and genomic loci.

Ji Renkai R   Cong Qian Q   Li Wenlin W   Grishin Nick V NV  

Bioinformatics (Oxford, England) 20130903 22


<h4>Summary</h4>Online Mendelian Inheritance in Man (OMIM) is a manually curated compendium of human genetic variants and the corresponding phenotypes, mostly human diseases. Instead of directly documenting the native sequences for gene entries, OMIM links its entries to protein and DNA sequences in other databases. However, because of the existence of gene isoforms and errors in OMIM records, mapping a specific OMIM mutation to its corresponding protein sequence is not trivial. Combining comput  ...[more]

Similar Datasets

| S-EPMC1196395 | biostudies-literature
| S-EPMC9601451 | biostudies-literature
| S-EPMC10521080 | biostudies-literature
| S-EPMC8891017 | biostudies-literature
| S-EPMC9316745 | biostudies-literature
2016-07-06 | E-GEOD-72696 | biostudies-arrayexpress
2016-07-06 | GSE72696 | GEO
| S-EPMC4314773 | biostudies-literature
| S-EPMC4666734 | biostudies-literature
| S-EPMC3606822 | biostudies-literature