Unknown

Dataset Information

0

ProGMap: an integrated annotation resource for protein orthology.


ABSTRACT: Current protein sequence databases employ different classification schemes that often provide conflicting annotations, especially for poorly characterized proteins. ProGMap (Protein Group Mappings, http://www.bioinformatics.nl/progmap) is a web-tool designed to help researchers and database annotators to assess the coherence of protein groups defined in various databases and thereby facilitate the annotation of newly sequenced proteins. ProGMap is based on a non-redundant dataset of over 6.6 million protein sequences which is mapped to 240,000 protein group descriptions collected from UniProt, RefSeq, Ensembl, COG, KOG, OrthoMCL-DB, HomoloGene, TRIBES and PIRSF. ProGMap combines the underlying classification schemes via a network of links constructed by a fast and fully automated mapping approach originally developed for document classification. The web interface enables queries to be made using sequence identifiers, gene symbols, protein functions or amino acid and nucleotide sequences. For the latter query type BLAST similarity search and QuickMatch identity search services have been incorporated, for finding sequences similar (or identical) to a query sequence. ProGMap is meant to help users of high throughput methodologies who deal with partially annotated genomic data.

SUBMITTER: Kuzniar A 

PROVIDER: S-EPMC2703891 | biostudies-literature | 2009 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

ProGMap: an integrated annotation resource for protein orthology.

Kuzniar Arnold A   Lin Ke K   He Ying Y   Nijveen Harm H   Pongor Sándor S   Leunissen Jack A M JA  

Nucleic acids research 20090603 Web Server issue


Current protein sequence databases employ different classification schemes that often provide conflicting annotations, especially for poorly characterized proteins. ProGMap (Protein Group Mappings, http://www.bioinformatics.nl/progmap) is a web-tool designed to help researchers and database annotators to assess the coherence of protein groups defined in various databases and thereby facilitate the annotation of newly sequenced proteins. ProGMap is based on a non-redundant dataset of over 6.6 mil  ...[more]

Similar Datasets

| S-EPMC10013926 | biostudies-literature
| S-EPMC10193443 | biostudies-literature
| S-EPMC2686469 | biostudies-literature
| S-EPMC5850834 | biostudies-literature
| S-EPMC4702792 | biostudies-literature
| S-EPMC3083348 | biostudies-literature
| S-EPMC1347458 | biostudies-literature
| S-EPMC8662613 | biostudies-literature
| S-EPMC9623898 | biostudies-literature
| S-EPMC3966033 | biostudies-literature