Unknown

Dataset Information

0

Intergenic, gene terminal, and intragenic CpG islands in the human genome.


ABSTRACT:

Background

Recently, it has been discovered that the human genome contains many transcription start sites for non-coding RNA. Regulatory regions related to transcription of this non-coding RNAs are poorly studied. Some of these regulatory regions may be associated with CpG islands located far from transcription start-sites of any protein coding gene. The human genome contains many such CpG islands; however, until now their properties were not systematically studied.

Results

We studied CpG islands located in different regions of the human genome using methods of bioinformatics and comparative genomics. We have observed that CpG islands have a preference to overlap with exons, including exons located far from transcription start site, but usually extend well into introns. Synonymous substitution rate of CpG-containing codons becomes substantially reduced in regions where CpG islands overlap with protein-coding exons, even if they are located far downstream from transcription start site. CAGE tag analysis displayed frequent transcription start sites in all CpG islands, including those found far from transcription start sites of protein coding genes. Computational prediction and analysis of published ChIP-chip data revealed that CpG islands contain an increased number of sites recognized by Sp1 protein. CpG islands containing more CAGE tags usually also contain more Sp1 binding sites. This is especially relevant for CpG islands located in 3' gene regions. Various examples of transcription, confirmed by mRNAs or ESTs, but with no evidence of protein coding genes, were found in CAGE-enriched CpG islands located far from transcription start site of any known protein coding gene.

Conclusions

CpG islands located far from transcription start sites of protein coding genes have transcription initiation activity and display Sp1 binding properties. In exons, overlapping with these islands, the synonymous substitution rate of CpG containing codons is decreased. This suggests that these CpG islands are involved in transcription initiation, possibly of some non-coding RNAs.

SUBMITTER: Medvedeva YA 

PROVIDER: S-EPMC2817693 | biostudies-literature | 2010 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications


<h4>Background</h4>Recently, it has been discovered that the human genome contains many transcription start sites for non-coding RNA. Regulatory regions related to transcription of this non-coding RNAs are poorly studied. Some of these regulatory regions may be associated with CpG islands located far from transcription start-sites of any protein coding gene. The human genome contains many such CpG islands; however, until now their properties were not systematically studied.<h4>Results</h4>We stu  ...[more]

Similar Datasets

| S-EPMC7470969 | biostudies-literature
2017-12-30 | GSE82142 | GEO
| S-EPMC3129250 | biostudies-literature
| S-EPMC5347632 | biostudies-literature
| S-EPMC3256200 | biostudies-literature
2017-12-30 | GSE82125 | GEO
2017-12-30 | GSE81925 | GEO
| S-EPMC5594649 | biostudies-literature
| S-EPMC4084425 | biostudies-literature
| S-EPMC4237456 | biostudies-literature