Unknown

Dataset Information

0

Genome-wide survey of pseudogenes in 80 fully re-sequenced Arabidopsis thaliana accessions.


ABSTRACT: Pseudogenes (?s), including processed and non-processed ?s, are ubiquitous genetic elements derived from originally functional genes in all studied genomes within the three kingdoms of life. However, systematic surveys of non-processed ?s utilizing genomic information from multiple samples within a species are still rare. Here a systematic comparative analysis was conducted of ?s within 80 fully re-sequenced Arabidopsis thaliana accessions, and 7546 genes, representing ?28% of the genomic annotated open reading frames (ORFs), were found with disruptive mutations in at least one accession. The distribution of these ?s on chromosomes showed a significantly negative correlation between ?s/ORFs and their local gene densities, suggesting a higher proportion of ?s in gene desert regions, e.g. near centromeres. On the other hand, compared with the non-? loci, even the intact coding sequences (CDSs) in the ? loci were found to have shorter CDS length, fewer exon number and lower GC content. In addition, a significant functional bias against the null hypothesis was detected in the ?s mainly involved in responses to environmental stimuli and biotic stress as reported, suggesting that they are likely important for adaptive evolution to rapidly changing environments by pseudogenization to accumulate successive mutations.

SUBMITTER: Wang L 

PROVIDER: S-EPMC3521719 | biostudies-literature | 2012

REPOSITORIES: biostudies-literature

altmetric image

Publications

Genome-wide survey of pseudogenes in 80 fully re-sequenced Arabidopsis thaliana accessions.

Wang Long L   Si Weina W   Yao Yongfang Y   Tian Dacheng D   Araki Hitoshi H   Yang Sihai S  

PloS one 20121213 12


Pseudogenes (Ψs), including processed and non-processed Ψs, are ubiquitous genetic elements derived from originally functional genes in all studied genomes within the three kingdoms of life. However, systematic surveys of non-processed Ψs utilizing genomic information from multiple samples within a species are still rare. Here a systematic comparative analysis was conducted of Ψs within 80 fully re-sequenced Arabidopsis thaliana accessions, and 7546 genes, representing ∼28% of the genomic annota  ...[more]

Similar Datasets

| S-EPMC4965157 | biostudies-literature
| S-EPMC4444734 | biostudies-literature
| S-EPMC3267885 | biostudies-literature
| S-EPMC403797 | biostudies-literature
2018-11-14 | ST001096 | MetabolomicsWorkbench
| S-EPMC5172462 | biostudies-literature
| S-EPMC4648415 | biostudies-literature
| S-EPMC4978887 | biostudies-literature
| S-EPMC4979618 | biostudies-other