Unknown

Dataset Information

0

Evolutionary characteristics of intergenic transcribed regions indicate rare novel genes and widespread noisy transcription in the Poaceae.


ABSTRACT: Extensive transcriptional activity occurring in intergenic regions of genomes has raised the question whether intergenic transcription represents the activity of novel genes or noisy expression. To address this, we evaluated cross-species and post-duplication sequence and expression conservation of intergenic transcribed regions (ITRs) in four Poaceae species. Among 43,301 ITRs across the four species, 34,460 (80%) are species-specific. ITRs found across species tend to be more divergent in expression and have more recent duplicates compared to annotated genes. To assess if ITRs are functional (under selection), machine learning models were established in Oryza sativa (rice) that could accurately distinguish between phenotype genes and pseudogenes (area under curve-receiver operating characteristic?=?0.94). Based on the models, 584 (8%) and 4391 (61%) rice ITRs are classified as likely functional and nonfunctional with high confidence, respectively. ITRs with conserved expression and ancient retained duplicates, features that were not part of the model, are frequently classified as likely-functional, suggesting these characteristics could serve as pragmatic rules of thumb for identifying candidate sequences likely to be under selection. This study also provides a framework to identify novel genes using comparative transcriptomic data to improve genome annotation that is fundamental for connecting genotype to phenotype in crop and model systems.

SUBMITTER: Lloyd JP 

PROVIDER: S-EPMC6702216 | biostudies-literature | 2019 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

Evolutionary characteristics of intergenic transcribed regions indicate rare novel genes and widespread noisy transcription in the Poaceae.

Lloyd John P JP   Bowman Megan J MJ   Azodi Christina B CB   Sowers Rosalie P RP   Moghe Gaurav D GD   Childs Kevin L KL   Shiu Shin-Han SH  

Scientific reports 20190820 1


Extensive transcriptional activity occurring in intergenic regions of genomes has raised the question whether intergenic transcription represents the activity of novel genes or noisy expression. To address this, we evaluated cross-species and post-duplication sequence and expression conservation of intergenic transcribed regions (ITRs) in four Poaceae species. Among 43,301 ITRs across the four species, 34,460 (80%) are species-specific. ITRs found across species tend to be more divergent in expr  ...[more]

Similar Datasets

| S-EPMC4316566 | biostudies-literature
| S-EPMC9139194 | biostudies-literature
| PRJEB44845 | ENA
2024-05-13 | GSE243549 | GEO
2003-10-30 | GSE639 | GEO
| S-EPMC10543033 | biostudies-literature
| S-EPMC1599769 | biostudies-literature
2010-06-05 | E-GEOD-639 | biostudies-arrayexpress
| S-EPMC10471719 | biostudies-literature
| S-EPMC1855179 | biostudies-literature