Unknown

Dataset Information

0

DCMP: database of cancer mutant protein domains.


ABSTRACT: Protein domains are functional and structural units of proteins. They are responsible for a particular function that contributes to protein's overall role. Because of this essential role, the majority of the genetic variants occur in the domains. In this study, the somatic mutations across 21 cancer types were mapped to the individual protein domains. To map the mutations to the domains, we employed the whole human proteome to predict the domains in each protein sequence and recognized about 149 668 domains. A novel Perl-API program was developed to convert the protein domain positions into genomic positions, and users can freely access them through GitHub. We determined the distribution of protein domains across 23 chromosomes with the help of these genomic positions. Interestingly, chromosome 19 has more number of protein domains in comparison with other chromosomes. Then, we mapped the cancer mutations to all the protein domains. Around 46-65% of mutations were mapped to their corresponding protein domains, and significantly mutated domains for all the cancer types were determined using the local false discovery ratio (locfdr). The chromosome positions for all the protein domains can be verified using the cross-reference ensemble database. Database URL: https://dcmp.vit.ac.in/.

SUBMITTER: Emerson IA 

PROVIDER: S-EPMC8607521 | biostudies-literature | 2021 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

DCMP: database of cancer mutant protein domains.

Emerson Isaac Arnold IA   Chitluri Kiran Kumar KK  

Database : the journal of biological databases and curation 20211101 2021


Protein domains are functional and structural units of proteins. They are responsible for a particular function that contributes to protein's overall role. Because of this essential role, the majority of the genetic variants occur in the domains. In this study, the somatic mutations across 21 cancer types were mapped to the individual protein domains. To map the mutations to the domains, we employed the whole human proteome to predict the domains in each protein sequence and recognized about 149  ...[more]

Similar Datasets

| S-EPMC6025185 | biostudies-literature
| S-EPMC7778928 | biostudies-literature
| S-EPMC403726 | biostudies-literature
| S-EPMC6260753 | biostudies-literature
| S-EPMC7157182 | biostudies-literature
| S-EPMC6841964 | biostudies-literature
| S-EPMC1347450 | biostudies-literature
| S-EPMC29841 | biostudies-literature
2018-09-06 | PRJEB28526 | EVA
| S-EPMC2779824 | biostudies-literature