Proteomics

Dataset Information

0

Nematode gene annotation by machine learning assisted proteotranscriptomics enables proteome-wide evolutionary analysis


ABSTRACT: Nematodes encompass over 24,000 described species, which were discovered in almost every ecological habitat, and make up over 80% of metazoan taxonomic diversity in soils. The last common ancestor of nematodes is believed to date back to around 650–750 million years, generating a large and phylogenetically diverse group to be explored. However, for most species high quality gene annotations are incomprehensive or missing. Combining short-read RNA sequencing with mass spectrometry-based proteomics and machine learning quality control in an approach called proteotranscriptomics, we improve gene annotations for 9 genome-sequenced nematode species and provide new gene annotations for 3 additional species without genome assemblies. Emphasizing the sensitivity of our methodology, we provide evidence for two hitherto undescribed genes in the model organism Caenorhabditis elegans. Extensive phylogenetic systems analysis using this comprehensive proteome annotation provides new insights into evolutionary processes of this metazoan group.

INSTRUMENT(S): Q Exactive

ORGANISM(S): Caenorhabditis Elegans

SUBMITTER: F Butter  

LAB HEAD: Falk Butter

PROVIDER: PXD034107 | Pride | 2022-11-12

REPOSITORIES: Pride

Dataset's files

Source:

Similar Datasets

2015-04-23 | E-GEOD-60358 | biostudies-arrayexpress
2016-06-29 | PXD004276 | Pride
2016-06-20 | PXD003304 | Pride
2017-11-08 | PXD006489 | Pride
2021-01-20 | PXD020632 | Pride
2011-08-19 | E-GEOD-29779 | biostudies-arrayexpress
2016-07-15 | E-GEOD-84456 | biostudies-arrayexpress
2021-06-28 | PXD020870 | Pride
2013-02-26 | E-GEOD-44615 | biostudies-arrayexpress
2022-08-12 | PXD023862 | Pride