Unknown

Dataset Information

0

The Co-regulation Data Harvester: automating gene annotation starting from a transcriptome database.


ABSTRACT: Identifying co-regulated genes provides a useful approach for defining pathway-specific machinery in an organism. To be efficient, this approach relies on thorough genome annotation, a process much slower than genome sequencing per se. Tetrahymena thermophila, a unicellular eukaryote, has been a useful model organism and has a fully sequenced but sparsely annotated genome. One important resource for studying this organism has been an online transcriptomic database. We have developed an automated approach to gene annotation in the context of transcriptome data in T. thermophila, called the Co-regulation Data Harvester (CDH). Beginning with a gene of interest, the CDH identifies co-regulated genes by accessing the Tetrahymena transcriptome database. It then identifies their closely related genes (orthologs) in other organisms by using reciprocal BLAST searches. Finally, it collates the annotations of those orthologs' functions, which provides the user with information to help predict the cellular role of the initial query. The CDH, which is freely available, represents a powerful new tool for analyzing cell biological pathways in Tetrahymena. Moreover, to the extent that genes and pathways are conserved between organisms, the inferences obtained via the CDH should be relevant, and can be explored, in many other systems.

SUBMITTER: Tsypin LM 

PROVIDER: S-EPMC5663188 | biostudies-literature | 2017

REPOSITORIES: biostudies-literature

altmetric image

Publications

The Co-regulation Data Harvester: automating gene annotation starting from a transcriptome database.

Tsypin Lev M LM   Turkewitz Aaron P AP  

SoftwareX 20170816


Identifying co-regulated genes provides a useful approach for defining pathway-specific machinery in an organism. To be efficient, this approach relies on thorough genome annotation, a process much slower than genome sequencing <i>per se. Tetrahymena thermophila</i>, a unicellular eukaryote, has been a useful model organism and has a fully sequenced but sparsely annotated genome. One important resource for studying this organism has been an online transcriptomic database. We have developed an au  ...[more]

Similar Datasets

| S-EPMC4303599 | biostudies-literature
| S-EPMC6323909 | biostudies-other
| S-EPMC11245099 | biostudies-literature
| S-EPMC9669643 | biostudies-literature
| S-EPMC6668405 | biostudies-literature
| S-EPMC5072840 | biostudies-literature
| S-EPMC1965490 | biostudies-literature
2013-09-19 | GSE43892 | GEO
2016-07-05 | GSE77776 | GEO
| S-EPMC8022709 | biostudies-literature