Unknown

Dataset Information

0

DBTSS: DataBase of human Transcriptional Start Sites and full-length cDNAs.


ABSTRACT: Although the information of cDNAs is indispensable for analyzing gene function, most of the cDNA sequences stored in current databases are imperfect in the sense that they lack the precise information of 5' end termini. To overcome this difficulty, we have developed the oligo-capping method to obtain full-length cDNAs, the information of which has been partly deposited in public databases. In this study, we further constructed human cDNA libraries enriched in clones containing the cap structure to systematically explore the 5' end structure of expressed genes. Of approximately 217 402 5' end sequences obtained, 111 382 have been matched to cDNA sequences of known genes (7889 genes) and are presented in our new database, DataBase of Transcriptional Start Sites (DBTSS; http://elmo.ims.u-tokyo.ac.jp/dbtss/). Sequence comparison between our entries and those of a reference sequence database, RefSeq, revealed that 4683 (34%) of RefSeq sequences should be extended towards the 5' ends. We also mapped each sequence on the human draft genome sequence to identify its transcriptional start site, which provides us with more detailed information on distribution patterns of transcriptional start sites and adjacent regulatory regions.

SUBMITTER: Suzuki Y 

PROVIDER: S-EPMC99097 | biostudies-literature | 2002 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

DBTSS: DataBase of human Transcriptional Start Sites and full-length cDNAs.

Suzuki Yutaka Y   Yamashita Riu R   Nakai Kenta K   Sugano Sumio S  

Nucleic acids research 20020101 1


Although the information of cDNAs is indispensable for analyzing gene function, most of the cDNA sequences stored in current databases are imperfect in the sense that they lack the precise information of 5' end termini. To overcome this difficulty, we have developed the oligo-capping method to obtain full-length cDNAs, the information of which has been partly deposited in public databases. In this study, we further constructed human cDNA libraries enriched in clones containing the cap structure  ...[more]

Similar Datasets

| S-EPMC4176323 | biostudies-literature
2022-04-21 | GSE190930 | GEO
| S-EPMC3245115 | biostudies-literature
| S-EPMC2650624 | biostudies-literature
| S-EPMC311163 | biostudies-literature
| S-EPMC2780955 | biostudies-literature
| S-EPMC1716722 | biostudies-literature
| S-EPMC2774520 | biostudies-literature
| S-EPMC2686583 | biostudies-literature
| S-EPMC3095526 | biostudies-literature