Unknown

Dataset Information

0

A factor model to analyze heterogeneity in gene expression.


ABSTRACT:

Background

Microarray technology allows the simultaneous analysis of thousands of genes within a single experiment. Significance analyses of transcriptomic data ignore the gene dependence structure. This leads to correlation among test statistics which affects a strong control of the false discovery proportion. A recent method called FAMT allows capturing the gene dependence into factors in order to improve high-dimensional multiple testing procedures. In the subsequent analyses aiming at a functional characterization of the differentially expressed genes, our study shows how these factors can be used both to identify the components of expression heterogeneity and to give more insight into the underlying biological processes.

Results

The use of factors to characterize simple patterns of heterogeneity is first demonstrated on illustrative gene expression data sets. An expression data set primarily generated to map QTL for fatness in chickens is then analyzed. Contrarily to the analysis based on the raw data, a relevant functional information about a QTL region is revealed by factor-adjustment of the gene expressions. Additionally, the interpretation of the independent factors regarding known information about both experimental design and genes shows that some factors may have different and complex origins.

Conclusions

As biological information and technological biases are identified in what was before simply considered as statistical noise, analyzing heterogeneity in gene expression yields a new point of view on transcriptomic data.

SUBMITTER: Blum Y 

PROVIDER: S-EPMC2911460 | biostudies-literature | 2010 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

A factor model to analyze heterogeneity in gene expression.

Blum Yuna Y   Le Mignon Guillaume G   Lagarrigue Sandrine S   Causeur David D  

BMC bioinformatics 20100702


<h4>Background</h4>Microarray technology allows the simultaneous analysis of thousands of genes within a single experiment. Significance analyses of transcriptomic data ignore the gene dependence structure. This leads to correlation among test statistics which affects a strong control of the false discovery proportion. A recent method called FAMT allows capturing the gene dependence into factors in order to improve high-dimensional multiple testing procedures. In the subsequent analyses aiming a  ...[more]

Similar Datasets

| S-EPMC3117390 | biostudies-literature
| S-EPMC5615240 | biostudies-literature
| S-EPMC2696132 | biostudies-literature
| S-EPMC7305041 | biostudies-literature
| S-EPMC3459542 | biostudies-literature
| S-EPMC440605 | biostudies-literature
| S-EPMC10809200 | biostudies-literature
| S-EPMC7301895 | biostudies-literature
| S-EPMC8415427 | biostudies-literature
| S-EPMC6075455 | biostudies-literature