Dataset Information

SoyDB: a knowledge database of soybean transcription factors.

ABSTRACT: BACKGROUND:Transcription factors play the crucial rule of regulating gene expression and influence almost all biological processes. Systematically identifying and annotating transcription factors can greatly aid further understanding their functions and mechanisms. In this article, we present SoyDB, a user friendly database containing comprehensive knowledge of soybean transcription factors. DESCRIPTION:The soybean genome was recently sequenced by the Department of Energy-Joint Genome Institute (DOE-JGI) and is publicly available. Mining of this sequence identified 5,671 soybean genes as putative transcription factors. These genes were comprehensively annotated as an aid to the soybean research community. We developed SoyDB - a knowledge database for all the transcription factors in the soybean genome. The database contains protein sequences, predicted tertiary structures, putative DNA binding sites, domains, homologous templates in the Protein Data Bank (PDB), protein family classifications, multiple sequence alignments, consensus protein sequence motifs, web logo of each family, and web links to the soybean transcription factor database PlantTFDB, known EST sequences, and other general protein databases including Swiss-Prot, Gene Ontology, KEGG, EMBL, TAIR, InterPro, SMART, PROSITE, NCBI, and Pfam. The database can be accessed via an interactive and convenient web server, which supports full-text search, PSI-BLAST sequence search, database browsing by protein family, and automatic classification of a new protein sequence into one of 64 annotated transcription factor families by hidden Markov models. CONCLUSIONS:A comprehensive soybean transcription factor database was constructed and made publicly accessible at http://casp.rnet.missouri.edu/soydb/.

SUBMITTER: Wang Z

PROVIDER: S-EPMC2826334 | biostudies-literature | 2010 Jan

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

SoyDB: a knowledge database of soybean transcription factors.

Wang Zheng Z Libault Marc M Joshi Trupti T Valliyodan Babu B Nguyen Henry T HT Xu Dong D Stacey Gary G Cheng Jianlin J

BMC plant biology 20100118

<h4>Background</h4>Transcription factors play the crucial rule of regulating gene expression and influence almost all biological processes. Systematically identifying and annotating transcription factors can greatly aid further understanding their functions and mechanisms. In this article, we present SoyDB, a user friendly database containing comprehensive knowledge of soybean transcription factors.<h4>Description</h4>The soybean genome was recently sequenced by the Department of Energy-Joint Ge ...[more]

PMID: 20082720

Similar Datasets

Project description:Background: Many tools used to analyze microarrays in different conditions have been described. However, the integration of the deregulated genes within coherent metabolic pathways is lacking. Currently no objective selection criterion, based on biological functions exists, to determine a threshold demonstrating that a gene is indeed differentially expressed. Methodology/Principal Findings: To improve transcriptomic analysis of microarrays, we propose a new statistical approach, which takes into account biological parameters. We present an iterative method to optimise the selection of differentially expressed gene in two experimental conditions. The stringency level of gene selection was associated simultaneously with the p-value of expression variation and the occurrence rate parameter, which is associated with the percentage of donors whose transcriptomic profile is similar. Our method intertwines stringency level settings, biological data and a knowledge database to highlight molecular interactions using networks and pathways. Analysis performed during iterations helped us select the optimal threshold required for the most pertinent selection of differently expressed genes. Conclusions/significance: We have applied this approach to the well documented mechanism of human macrophage response to lipopolysaccharide stimulation. For example, we thus verified that our method was able to determine with the highest degree of accuracy the best threshold for selecting genes, which are truly differentially expressed. Macrophages isolated from six heathy donnor was/or not stimulated. Paired data, i.e. LPS stimulated macrophages versus unstimulated macrophages from the same donor have been compared (eg, Donor1_LPS vs Donor1_NT; see processed data file linked below). The six comparaisons have been globaly analyse using two parameters, i.e. threshod and occurency, associated with a request of a database knowledge. Both parameters has been tune to define the best setting allowing to optimize the selection of differentially expressed genes

Dataset Information

SoyDB: a knowledge database of soybean transcription factors.

Publications

SoyDB: a knowledge database of soybean transcription factors.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets