Unknown

Dataset Information

0

Use of normalization methods for analysis of microarrays containing a high degree of gene effects.


ABSTRACT:

Background

High-throughput microarrays are widely used to study gene expression across tissues and developmental stages. Analysis of gene expression data is challenging in these experiments due to the presence of significant percentages of differentially expressed genes (DEG) observed between tissues and developmental stages. Data normalization methods that are widely used today are not designed for data with a large proportion of tissue or gene effects.

Results

In our current study, we describe a novel two-dimensional nonparametric normalization method for analyzing microarray data which functions well in the absence or presence of large numbers of gene effects. Rather than relying on an assumption of low variability among most genes, the method implements a unique peak selection strategy to distinguish DEG from genes that are invariant in expression, prior to nonlinear curve fitting. We compared the method under simulated and experimental conditions with five alternative nonlinear normalization approaches: quantile, lowess, robust lowess, invariant set, and cross-correlation (Xcorr). Simulations included various percentages of simulated DEG and the experimental data used is from publicly available datasets known to be difficult to analyze due to the presence of approximately 34% DEG.

Conclusion

We have demonstrated that the new method provides considerable improvement in the accuracy of data normalization when large proportions of gene effects are present. The performance improvement is mostly attributed to its variable selection component, which is designed to separate expression invariant genes from DEG. Adding this key component of the new method to alternative normalization approaches rescues the most of the sensitivity of these methods to gene effects. The results indicate that our method may be used without prior knowledge of or assumptions about housekeeping genes to normalize microarrays that are quite different.

SUBMITTER: Ni TT 

PROVIDER: S-EPMC2612699 | biostudies-literature | 2008 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Use of normalization methods for analysis of microarrays containing a high degree of gene effects.

Ni Terri T TT   Lemon William J WJ   Shyr Yu Y   Zhong Tao P TP  

BMC bioinformatics 20081128


<h4>Background</h4>High-throughput microarrays are widely used to study gene expression across tissues and developmental stages. Analysis of gene expression data is challenging in these experiments due to the presence of significant percentages of differentially expressed genes (DEG) observed between tissues and developmental stages. Data normalization methods that are widely used today are not designed for data with a large proportion of tissue or gene effects.<h4>Results</h4>In our current stu  ...[more]

Similar Datasets

| S-EPMC2262854 | biostudies-literature
| S-EPMC2917409 | biostudies-literature
| S-EPMC3229535 | biostudies-literature
| S-EPMC2865860 | biostudies-literature
| S-EPMC8388031 | biostudies-literature
| S-EPMC105386 | biostudies-literature
| S-EPMC5910605 | biostudies-literature
| S-EPMC3236842 | biostudies-literature
| S-EPMC3358021 | biostudies-literature
| S-EPMC4625728 | biostudies-literature