Unknown

Dataset Information

0

Genomic fossils as a snapshot of the human transcriptome.


ABSTRACT: Processed pseudogenes (PPGs) are cDNA sequences that were generated through reverse transcription of mature, spliced mRNAs and have subsequently been reinserted at a new genomic location. These cDNA sequences are usually no longer transcribed and are considered "dead on arrival." Here we show that PPGs can be used to generate a map of the transcriptome. By analyzing thousands of human PPGs, we were able to discover hundreds of transcript variants so far unidentified. An experimental verification of a subset of these variants by RT-PCR indicates that most of them are still active in the human transcriptome. Furthermore, we demonstrate that PPGs can enable the identification of ancient splice variants that were expressed ancestrally but are now extinct. Our results show that the genome itself carries a "virtual cDNA library" that can readily be used to analyze both present and ancestral transcripts. Our approach can be applied to sequenced metazoan genomes to computationally annotate splicing variation even when expressed sequences are unavailable.

SUBMITTER: Shemesh R 

PROVIDER: S-EPMC1360558 | biostudies-literature | 2006 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

Genomic fossils as a snapshot of the human transcriptome.

Shemesh Ronen R   Novik Amit A   Edelheit Sarit S   Sorek Rotem R  

Proceedings of the National Academy of Sciences of the United States of America 20060123 5


Processed pseudogenes (PPGs) are cDNA sequences that were generated through reverse transcription of mature, spliced mRNAs and have subsequently been reinserted at a new genomic location. These cDNA sequences are usually no longer transcribed and are considered "dead on arrival." Here we show that PPGs can be used to generate a map of the transcriptome. By analyzing thousands of human PPGs, we were able to discover hundreds of transcript variants so far unidentified. An experimental verification  ...[more]

Similar Datasets

| PRJEB2131 | ENA
| S-EPMC2946954 | biostudies-literature
| S-EPMC4046930 | biostudies-literature
| S-EPMC8478212 | biostudies-literature
| S-EPMC2602669 | biostudies-literature
| S-EPMC39958 | biostudies-other
| S-EPMC8927662 | biostudies-literature
| S-EPMC8790175 | biostudies-literature
| S-EPMC5491270 | biostudies-literature
| S-EPMC3970191 | biostudies-literature