Unknown

Dataset Information

0

Long-read cDNA sequencing identifies functional pseudogenes in the human transcriptome.


ABSTRACT: Pseudogenes are gene copies presumed to mainly be functionless relics of evolution due to acquired deleterious mutations or transcriptional silencing. Using deep full-length PacBio cDNA sequencing of normal human tissues and cancer cell lines, we identify here hundreds of novel transcribed pseudogenes expressed in tissue-specific patterns. Some pseudogene transcripts have intact open reading frames and are translated in cultured cells, representing unannotated protein-coding genes. To assess the biological impact of noncoding pseudogenes, we CRISPR-Cas9 delete the nucleus-enriched pseudogene PDCL3P4 and observe hundreds of perturbed genes. This study highlights pseudogenes as a complex and dynamic component of the human transcriptional landscape.

SUBMITTER: Troskie RL 

PROVIDER: S-EPMC8108447 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

2021-04-26 | GSE160383 | GEO
| PRJNA673144 | ENA
| S-EPMC6664245 | biostudies-literature
| S-EPMC4915659 | biostudies-literature
| S-EPMC7596999 | biostudies-literature
2022-05-31 | PXD031213 | Pride
| S-EPMC6240054 | biostudies-literature
| S-EPMC8739696 | biostudies-literature
| S-EPMC4176000 | biostudies-literature
| S-EPMC5550469 | biostudies-other