Unknown

Dataset Information

0

Sequence, Structure, and Functional Space of Drosophila De Novo Proteins.


ABSTRACT: During de novo emergence, new protein coding genes emerge from previously nongenic sequences. The de novo proteins they encode are dissimilar in composition and predicted biochemical properties to conserved proteins. However, functional de novo proteins indeed exist. Both identification of functional de novo proteins and their structural characterization are experimentally laborious. To identify functional and structured de novo proteins in silico, we applied recently developed machine learning based tools and found that most de novo proteins are indeed different from conserved proteins both in their structure and sequence. However, some de novo proteins are predicted to adopt known protein folds, participate in cellular reactions, and to form biomolecular condensates. Apart from broadening our understanding of de novo protein evolution, our study also provides a large set of testable hypotheses for focused experimental studies on structure and function of de novo proteins in Drosophila.

SUBMITTER: Middendorf L 

PROVIDER: S-EPMC11363682 | biostudies-literature | 2024 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

Sequence, Structure, and Functional Space of Drosophila De Novo Proteins.

Middendorf Lasse L   Ravi Iyengar Bharat B   Eicholt Lars A LA  

Genome biology and evolution 20240801 8


During de novo emergence, new protein coding genes emerge from previously nongenic sequences. The de novo proteins they encode are dissimilar in composition and predicted biochemical properties to conserved proteins. However, functional de novo proteins indeed exist. Both identification of functional de novo proteins and their structural characterization are experimentally laborious. To identify functional and structured de novo proteins in silico, we applied recently developed machine learning  ...[more]

Similar Datasets

| S-EPMC3956894 | biostudies-literature
| S-EPMC6288389 | biostudies-literature
| S-EPMC2865713 | biostudies-literature
| S-EPMC7954818 | biostudies-literature
| S-EPMC10089919 | biostudies-literature
| S-EPMC6733475 | biostudies-literature
| S-EPMC8114120 | biostudies-literature
| S-EPMC10138783 | biostudies-literature
| S-EPMC6030986 | biostudies-literature
| S-EPMC2753099 | biostudies-literature