Unknown

Dataset Information

0

Single-Base Resolution Map of Evolutionary Constraints and Annotation of Conserved Elements across Major Grass Genomes.


ABSTRACT: Conserved noncoding sequences (CNSs) are evolutionarily conserved DNA sequences that do not encode proteins but may have potential regulatory roles in gene expression. CNS in crop genomes could be linked to many important agronomic traits and ecological adaptations. Compared with the relatively mature exon annotation protocols, efficient methods are lacking to predict the location of noncoding sequences in the plant genomes. We implemented a computational pipeline that is tailored to the comparisons of plant genomes, yielding a large number of conserved sequences using rice genome as the reference. In this study, we used 17 published grass genomes, along with five monocot genomes as well as the basal angiosperm genome of Amborella trichopoda. Genome alignments among these genomes suggest that at least 12.05% of the rice genome appears to be evolving under constraints in the Poaceae lineage, with close to half of the evolutionarily constrained sequences located outside protein-coding regions. We found evidence for purifying selection acting on the conserved sequences by analyzing segregating SNPs within the rice population. Furthermore, we found that known functional motifs were significantly enriched within CNS, with many motifs associated with the preferred binding of ubiquitous transcription factors. The conserved elements that we have curated are accessible through our public database and the JBrowse server. In-depth functional annotations and evolutionary dynamics of the identified conserved sequences provide a solid foundation for studying gene regulation, genome evolution, as well as to inform gene isolation for cereal biologists.

SUBMITTER: Liang P 

PROVIDER: S-EPMC5798027 | biostudies-literature | 2018 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

Single-Base Resolution Map of Evolutionary Constraints and Annotation of Conserved Elements across Major Grass Genomes.

Liang Pingping P   Saqib Hafiz Sohaib Ahmed HSA   Zhang Xingtan X   Zhang Liangsheng L   Tang Haibao H  

Genome biology and evolution 20180201 2


Conserved noncoding sequences (CNSs) are evolutionarily conserved DNA sequences that do not encode proteins but may have potential regulatory roles in gene expression. CNS in crop genomes could be linked to many important agronomic traits and ecological adaptations. Compared with the relatively mature exon annotation protocols, efficient methods are lacking to predict the location of noncoding sequences in the plant genomes. We implemented a computational pipeline that is tailored to the compari  ...[more]

Similar Datasets

| S-EPMC3579700 | biostudies-literature
| S-EPMC2798825 | biostudies-literature
| S-EPMC6785632 | biostudies-literature
| S-EPMC5373346 | biostudies-literature
2009-10-21 | E-GEOD-18630 | biostudies-arrayexpress
| S-EPMC1182216 | biostudies-literature
| S-EPMC8374106 | biostudies-literature
| S-EPMC3281118 | biostudies-literature
| S-EPMC3786739 | biostudies-literature
| S-EPMC2579482 | biostudies-literature