Unknown

Dataset Information

0

Subset quantile normalization using negative control features.


ABSTRACT: Normalization has been recognized as a necessary preprocessing step in a variety of high-throughput biotechnologies. A number of normalization methods have been developed specifically for microarrays, some general and others tailored for certain experimental designs. All methods rely on assumptions about data characteristics that are expected to stay constant across samples, although some make it more explicit than others. Most methods make assumptions that certain quantities related to the biological signal of interest stay the same; this is reasonable for many experiments but usually not verifiable. Recently, several platforms have begun to include a large number of negative control probes that nonetheless cover nearly the entire range of the measured signal intensity. Using these probes as a normalization basis makes it possible to normalize without making assumptions about the behavior of the biological signal. We present a subset quantile normalization (SQN) procedure that normalizes based on the distribution of non-specific control features, without restriction on the behavior of specific signals. We illustrate the performance of this method using three different platforms and experimental settings. Compared to two other leading nonlinear normalization procedures, the SQN method preserves more biological variation after normalization while reducing the noise observed on control features. Although the illustration datasets are from microarray experiments, this method is general for all high throughput technologies that include a large set of control features that have constant expectations across samples. It does not require an equal number of features in all samples and tolerates missing data.

SUBMITTER: Wu Z 

PROVIDER: S-EPMC3122888 | biostudies-literature | 2010 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

Subset quantile normalization using negative control features.

Wu Zhijin Z   Aryee Martin J MJ  

Journal of computational biology : a journal of computational molecular cell biology 20101001 10


Normalization has been recognized as a necessary preprocessing step in a variety of high-throughput biotechnologies. A number of normalization methods have been developed specifically for microarrays, some general and others tailored for certain experimental designs. All methods rely on assumptions about data characteristics that are expected to stay constant across samples, although some make it more explicit than others. Most methods make assumptions that certain quantities related to the biol  ...[more]

Similar Datasets

| S-EPMC3446316 | biostudies-literature
| S-EPMC5862355 | biostudies-other
| S-EPMC3297825 | biostudies-literature
| S-EPMC5972664 | biostudies-literature
| S-EPMC7511327 | biostudies-literature
| S-EPMC7333325 | biostudies-literature
| S-EPMC6748729 | biostudies-literature
| S-EPMC7055659 | biostudies-literature
| S-EPMC6927181 | biostudies-literature
| S-EPMC3660216 | biostudies-literature