Unknown

Dataset Information

0

Formation of human long intergenic non-coding RNA genes, pseudogenes, and protein genes: Ancestral sequences are key players.


ABSTRACT: Pathways leading to formation of non-coding RNA and protein genes are varied and complex. We report finding a conserved repeat sequence present in human and chimpanzee genomes that appears to have originated from a common primate ancestor. This sequence is repeatedly copied in human chromosome 22 (chr22) low copy repeats (LCR22) or segmental duplications and forms twenty-one different genes, which include the human long intergenic non-coding RNA (lincRNA) family FAM230, a newly discovered lincRNA gene family termed conserved long intergenic non-coding RNAs (clincRNA), pseudogene families, as well as the gamma-glutamyltransferase (GGT) protein gene family and the RNA pseudogenes that originate from GGT sequences. Of particular interest are the GGT5 and USP18 protein genes that appear to have formed from an homologous repeat sequence that also forms the clincRNA gene family. The data point to ancestral DNA sequences, conserved through evolution and duplicated in humans by chromosomal repeat sequences that may serve as functional genomic elements in the development of diverse genes.

SUBMITTER: Delihas N 

PROVIDER: S-EPMC7098633 | biostudies-literature | 2020

REPOSITORIES: biostudies-literature

altmetric image

Publications

Formation of human long intergenic non-coding RNA genes, pseudogenes, and protein genes: Ancestral sequences are key players.

Delihas Nicholas N  

PloS one 20200326 3


Pathways leading to formation of non-coding RNA and protein genes are varied and complex. We report finding a conserved repeat sequence present in human and chimpanzee genomes that appears to have originated from a common primate ancestor. This sequence is repeatedly copied in human chromosome 22 (chr22) low copy repeats (LCR22) or segmental duplications and forms twenty-one different genes, which include the human long intergenic non-coding RNA (lincRNA) family FAM230, a newly discovered lincRN  ...[more]

Similar Datasets

| S-EPMC4586776 | biostudies-other
| S-EPMC4460805 | biostudies-other
2022-08-12 | PXD033707 | Pride
| S-EPMC5577775 | biostudies-literature
| S-EPMC8139166 | biostudies-literature
| S-EPMC5579197 | biostudies-literature
| S-EPMC3699063 | biostudies-literature
| S-EPMC5312340 | biostudies-literature
| S-EPMC5889127 | biostudies-literature