Unknown

Dataset Information

0

The Diatom EST Database.


ABSTRACT: The Diatom EST database provides integrated access to expressed sequence tag (EST) data from two eukaryotic microalgae of the class Bacillariophyceae, Phaeodactylum tricornutum and Thalassiosira pseudonana. The database currently contains sequences of close to 30,000 ESTs organized into PtDB, the P.tricornutum EST database, and TpDB, the T.pseudonana EST database. The EST sequences were clustered and assembled into a non-redundant set for each organism, and these non-redundant sequences were then subjected to automated annotation using similarity searches against protein and domain databases. EST sequences, clusters of contiguous sequences, their annotation and analysis with reference to the publicly available databases, and a codon usage table derived from a subset of sequences from PtDB and TpDB can all be accessed in the Diatom EST Database. The underlying RDBMS enables queries over the raw and annotated EST data and retrieval of information through a user-friendly web interface, with options to perform keyword and BLAST searches. The EST data can also be retrieved based on Pfam domains, Cluster of Orthologous Groups (COG) and Gene Ontologies (GO) assigned to them by similarity searches. The Database is available at http://avesthagen.sznbowler.com.

SUBMITTER: Maheswari U 

PROVIDER: S-EPMC540075 | biostudies-literature | 2005 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications


The Diatom EST database provides integrated access to expressed sequence tag (EST) data from two eukaryotic microalgae of the class Bacillariophyceae, Phaeodactylum tricornutum and Thalassiosira pseudonana. The database currently contains sequences of close to 30,000 ESTs organized into PtDB, the P.tricornutum EST database, and TpDB, the T.pseudonana EST database. The EST sequences were clustered and assembled into a non-redundant set for each organism, and these non-redundant sequences were the  ...[more]

Similar Datasets

| S-EPMC2686495 | biostudies-literature
| S-EPMC308870 | biostudies-literature
| S-EPMC2221943 | biostudies-literature
| S-EPMC2483293 | biostudies-literature
| S-EPMC2034596 | biostudies-literature
| S-EPMC3036138 | biostudies-literature
| PRJEB74140 | ENA
| S-EPMC1539033 | biostudies-literature
| S-EPMC1885842 | biostudies-literature
2012-03-23 | GSE34322 | GEO