Unknown

Dataset Information

0

Identification and characterization of the potential promoter regions of 1031 kinds of human genes.


ABSTRACT: To understand the mechanism of transcriptional regulation, it is essential to identify and characterize the promoter, which is located proximal to the mRNA start site. To identify the promoters from the large volumes of genomic sequences, we used mRNA start sites determined by a large-scale sequencing of the cDNA libraries constructed by the "oligo-capping" method. We aligned the mRNA start sites with the genomic sequences and retrieved adjacent sequences as potential promoter regions (PPRs) for 1031 genes. The PPR sequences were searched to determine the frequencies of major promoter elements. Among 1031 PPRs, 329 (32%) contained TATA boxes, 872 (85%) contained initiators, 999 (97%) contained GC box, and 663 (64%) contained CAAT box. Furthermore, 493 (48%) PPRs were located in CpG islands. This frequency of CpG islands was reduced in TATA(+)/Inr(+) PPRs and in the PPRs of ubiquitously expressed genes. In the PPRs of the CGM2 gene, the DRA gene, and the TM30pl genes, which showed highly colon specific expression patterns, the consensus sequences of E boxes were commonly observed. The PPRs were also useful for exploring promoter SNPs.

SUBMITTER: Suzuki Y 

PROVIDER: S-EPMC311086 | biostudies-other | 2001 May

REPOSITORIES: biostudies-other

altmetric image

Publications

Identification and characterization of the potential promoter regions of 1031 kinds of human genes.

Suzuki Y Y   Tsunoda T T   Sese J J   Taira H H   Mizushima-Sugano J J   Hata H H   Ota T T   Isogai T T   Tanaka T T   Nakamura Y Y   Suyama A A   Sakaki Y Y   Morishita S S   Okubo K K   Sugano S S  

Genome research 20010501 5


To understand the mechanism of transcriptional regulation, it is essential to identify and characterize the promoter, which is located proximal to the mRNA start site. To identify the promoters from the large volumes of genomic sequences, we used mRNA start sites determined by a large-scale sequencing of the cDNA libraries constructed by the "oligo-capping" method. We aligned the mRNA start sites with the genomic sequences and retrieved adjacent sequences as potential promoter regions (PPRs) for  ...[more]

Similar Datasets

| S-EPMC3408873 | biostudies-literature
| S-EPMC1223041 | biostudies-other
| S-EPMC2781476 | biostudies-literature
| S-EPMC4085640 | biostudies-literature
| S-EPMC4839604 | biostudies-literature
| S-EPMC1360089 | biostudies-literature
| S-EPMC1557771 | biostudies-literature
| S-EPMC4396557 | biostudies-literature
| S-EPMC2963359 | biostudies-literature
| S-EPMC6135729 | biostudies-literature