Dataset Information

Power in pairs: assessing the statistical value of paired samples in tests for differential expression.

ABSTRACT: BACKGROUND:When genomics researchers design a high-throughput study to test for differential expression, some biological systems and research questions provide opportunities to use paired samples from subjects, and researchers can plan for a certain proportion of subjects to have paired samples. We consider the effect of this paired samples proportion on the statistical power of the study, using characteristics of both count (RNA-Seq) and continuous (microarray) expression data from a colorectal cancer study. RESULTS:We demonstrate that a higher proportion of subjects with paired samples yields higher statistical power, for various total numbers of samples, and for various strengths of subject-level confounding factors. In the design scenarios considered, the statistical power in a fully-paired design is substantially (and in many cases several times) greater than in an unpaired design. CONCLUSIONS:For the many biological systems and research questions where paired samples are feasible and relevant, substantial statistical power gains can be achieved at the study design stage when genomics researchers plan on using paired samples from the largest possible proportion of subjects. Any cost savings in a study design with unpaired samples are likely accompanied by underpowered and possibly biased results.

SUBMITTER: Stevens JR

PROVIDER: S-EPMC6302489 | biostudies-literature | 2018 Dec

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Power in pairs: assessing the statistical value of paired samples in tests for differential expression.

Stevens John R JR Herrick Jennifer S JS Wolff Roger K RK Slattery Martha L ML

BMC genomics 20181220 1

<h4>Background</h4>When genomics researchers design a high-throughput study to test for differential expression, some biological systems and research questions provide opportunities to use paired samples from subjects, and researchers can plan for a certain proportion of subjects to have paired samples. We consider the effect of this paired samples proportion on the statistical power of the study, using characteristics of both count (RNA-Seq) and continuous (microarray) expression data from a co ...[more]

PMID: 30572829

Dataset Information

Power in pairs: assessing the statistical value of paired samples in tests for differential expression.

Publications

Power in pairs: assessing the statistical value of paired samples in tests for differential expression.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Samples in many cell-based experiments are matched/paired but taking this into account does not always increase power of statistical tests for differences in means.
| S-EPMC10881176 | biostudies-literature

Comparison of small n statistical tests of differential expression applied to microarrays.
| S-EPMC2674054 | biostudies-literature

Comparison of Statistical Tests and Power Analysis for Phosphoproteomics Data.
| S-EPMC8042666 | biostudies-literature

Experiment design beyond gut feeling: statistical tests and power to detect differential metabolites in mass spectrometry data
2014-11-06 | MTBLS74 | MetaboLights

Statistical power estimation dataset for external validation GoF tests on EVT distribution.
| S-EPMC6562228 | biostudies-literature

Statistical tests, P values, confidence intervals, and power: a guide to misinterpretations.
| S-EPMC4877414 | biostudies-other

Comparing Statistical Tests for Differential Network Analysis of Gene Modules.
| S-EPMC8170128 | biostudies-literature

Hierarchicell: an R-package for estimating power for tests of differential expression with single-cell data.
| S-EPMC8088563 | biostudies-literature

Transcriptome from Paired Samples Improves the Power of Comprehensive COVID-19 Host-Viral Characterization.
| S-EPMC10487753 | biostudies-literature

The level of residual dispersion variation and the power of differential expression tests for RNA-Seq data.
| S-EPMC4388866 | biostudies-literature