Unknown

Dataset Information

0

Evaluating the consistency of gene sets used in the analysis of bacterial gene expression data.


ABSTRACT: BACKGROUND: Statistical analyses of whole genome expression data require functional information about genes in order to yield meaningful biological conclusions. The Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) are common sources of functionally grouped gene sets. For bacteria, the SEED and MicrobesOnline provide alternative, complementary sources of gene sets. To date, no comprehensive evaluation of the data obtained from these resources has been performed. RESULTS: We define a series of gene set consistency metrics directly related to the most common classes of statistical analyses for gene expression data, and then perform a comprehensive analysis of 3581 Affymetrix® gene expression arrays across 17 diverse bacteria. We find that gene sets obtained from GO and KEGG demonstrate lower consistency than those obtained from the SEED and MicrobesOnline, regardless of gene set size. CONCLUSIONS: Despite the widespread use of GO and KEGG gene sets in bacterial gene expression data analysis, the SEED and MicrobesOnline provide more consistent sets for a wide variety of statistical analyses. Increased use of the SEED and MicrobesOnline gene sets in the analysis of bacterial gene expression data may improve statistical power and utility of expression data.

SUBMITTER: Tintle NL 

PROVIDER: S-EPMC3462729 | biostudies-literature | 2012

REPOSITORIES: biostudies-literature

altmetric image

Publications

Evaluating the consistency of gene sets used in the analysis of bacterial gene expression data.

Tintle Nathan L NL   Sitarik Alexandra A   Boerema Benjamin B   Young Kylie K   Best Aaron A AA   Dejongh Matthew M  

BMC bioinformatics 20120808


<h4>Background</h4>Statistical analyses of whole genome expression data require functional information about genes in order to yield meaningful biological conclusions. The Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) are common sources of functionally grouped gene sets. For bacteria, the SEED and MicrobesOnline provide alternative, complementary sources of gene sets. To date, no comprehensive evaluation of the data obtained from these resources has been performed.<h4>Res  ...[more]

Similar Datasets

| S-EPMC8317122 | biostudies-literature
| S-EPMC5700673 | biostudies-literature
| S-EPMC8116545 | biostudies-literature
| S-EPMC2753845 | biostudies-other
| S-EPMC4652619 | biostudies-literature
| S-EPMC2711113 | biostudies-literature
| S-EPMC4625461 | biostudies-literature
| S-EPMC476733 | biostudies-literature
| S-EPMC10461906 | biostudies-literature
| S-EPMC3935204 | biostudies-other