Unknown

Dataset Information

0

A cautionary note for retrocopy identification: DNA-based duplication of intron-containing genes significantly contributes to the origination of single exon genes.


ABSTRACT:

Motivation

Retrocopies are important genes in the genomes of almost all higher eukaryotes. However, the annotation of such genes is a non-trivial task. Intronless genes have often been considered to be retroposed copies of intron-containing paralogs. Such categorization relies on the implicit premise that alignable regions of the duplicates should be long enough to cover exon-exon junctions of the intron-containing genes, and thus intron loss events can be inferred. Here, we examined the alternative possibility that intronless genes could be generated by partial DNA-based duplication of intron-containing genes in the fruitfly genome.

Results

By building pairwise protein-, transcript- and genome-level DNA alignments between intronless genes and their corresponding intron-containing paralogs, we found that alignments do not cover exon-exon junctions in 40% of cases and thus no intron loss could be inferred. For these cases, the candidate parental proteins tend to be partially duplicated, and intergenic sequences or neighboring genes are included in the intronless paralog. Moreover, we observed that it is significantly less likely for these paralogs to show inter-chromosomal duplication and testis-dominant transcription, compared to the remaining 60% of cases with evidence of clear intron loss (retrogenes). These lines of analysis reveal that DNA-based duplication contributes significantly to the 40% of cases of single exon gene duplication. Finally, we performed an analogous survey in the human genome and the result is similar, wherein 34% of the cases do not cover exon-exon junctions. Thus, genome annotation for retrogene identification should discard candidates without clear evidence of intron loss.

Contact

mlong@uchicago.edu; zhangy@uchicago.edu

SUBMITTER: Zhang YE 

PROVIDER: S-EPMC3117337 | biostudies-literature | 2011 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

A cautionary note for retrocopy identification: DNA-based duplication of intron-containing genes significantly contributes to the origination of single exon genes.

Zhang Yong E YE   Vibranovski Maria D MD   Krinsky Benjamin H BH   Long Manyuan M  

Bioinformatics (Oxford, England) 20110505 13


<h4>Motivation</h4>Retrocopies are important genes in the genomes of almost all higher eukaryotes. However, the annotation of such genes is a non-trivial task. Intronless genes have often been considered to be retroposed copies of intron-containing paralogs. Such categorization relies on the implicit premise that alignable regions of the duplicates should be long enough to cover exon-exon junctions of the intron-containing genes, and thus intron loss events can be inferred. Here, we examined the  ...[more]

Similar Datasets

| S-EPMC3268293 | biostudies-literature
| S-EPMC4635201 | biostudies-literature
| S-EPMC5619393 | biostudies-literature
| S-EPMC2140055 | biostudies-literature
| S-EPMC5477984 | biostudies-literature
| S-EPMC3172298 | biostudies-literature
| S-EPMC2891723 | biostudies-literature
| S-EPMC7885797 | biostudies-literature
| S-EPMC334893 | biostudies-other
| S-EPMC6385490 | biostudies-literature