Unknown

Dataset Information

0

Using control genes to correct for unwanted variation in microarray data.


ABSTRACT: Microarray expression studies suffer from the problem of batch effects and other unwanted variation. Many methods have been proposed to adjust microarray data to mitigate the problems of unwanted variation. Several of these methods rely on factor analysis to infer the unwanted variation from the data. A central problem with this approach is the difficulty in discerning the unwanted variation from the biological variation that is of interest to the researcher. We present a new method, intended for use in differential expression studies, that attempts to overcome this problem by restricting the factor analysis to negative control genes. Negative control genes are genes known a priori not to be differentially expressed with respect to the biological factor of interest. Variation in the expression levels of these genes can therefore be assumed to be unwanted variation. We name this method "Remove Unwanted Variation, 2-step" (RUV-2). We discuss various techniques for assessing the performance of an adjustment method and compare the performance of RUV-2 with that of other commonly used adjustment methods such as Combat and Surrogate Variable Analysis (SVA). We present several example studies, each concerning genes differentially expressed with respect to gender in the brain and find that RUV-2 performs as well or better than other methods. Finally, we discuss the possibility of adapting RUV-2 for use in studies not concerned with differential expression and conclude that there may be promise but substantial challenges remain.

SUBMITTER: Gagnon-Bartsch JA 

PROVIDER: S-EPMC3577104 | biostudies-literature | 2012 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Using control genes to correct for unwanted variation in microarray data.

Gagnon-Bartsch Johann A JA   Speed Terence P TP  

Biostatistics (Oxford, England) 20111117 3


Microarray expression studies suffer from the problem of batch effects and other unwanted variation. Many methods have been proposed to adjust microarray data to mitigate the problems of unwanted variation. Several of these methods rely on factor analysis to infer the unwanted variation from the data. A central problem with this approach is the difficulty in discerning the unwanted variation from the biological variation that is of interest to the researcher. We present a new method, intended fo  ...[more]

Similar Datasets

| S-EPMC2640357 | biostudies-literature
| S-EPMC5798764 | biostudies-literature
| S-EPMC4544854 | biostudies-literature
| S-EPMC4652745 | biostudies-literature
| S-EPMC8371158 | biostudies-literature
| S-EPMC3128035 | biostudies-literature
| S-EPMC1933513 | biostudies-literature
| S-EPMC8346590 | biostudies-literature
| S-EPMC3125784 | biostudies-literature
| S-EPMC4679071 | biostudies-literature