Dataset Information

Shape analysis of high-throughput transcriptomics experiment data.

ABSTRACT: The recent growth of high-throughput transcriptome technology has been paralleled by the development of statistical methodologies to analyze the data they produce. Some of these newly developed methods are based on the assumption that the data observed or a transformation of the data are relatively symmetric with light tails, usually summarized by assuming a Gaussian random component. It is indeed very difficult to assess this assumption for small sample sizes. In this article, we utilize L-moments statistics as the basis of exploratory data analysis, the assessment of distributional assumptions, and the hypothesis testing of high-throughput transcriptomic data. In particular, we use L-moments ratios for assessing the shape (skewness and kurtosis) of high-throughput transcriptome data. Based on these statistics, we propose an algorithm for identifying genes with distributions that are markedly different from the majority in the data. In addition, we also illustrate the utility of this framework to characterize the robustness of distributional assumptions. We apply it to RNA-seq data and find that methods based on the simple [Formula: see text]-test for differential expression analysis using L-moments as weights are robust.

SUBMITTER: Okrah K

PROVIDER: S-EPMC4570582 | biostudies-literature | 2015 Oct

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Shape analysis of high-throughput transcriptomics experiment data.

Okrah Kwame K Corrada Bravo Héctor H

Biostatistics (Oxford, England) 20150511 4

The recent growth of high-throughput transcriptome technology has been paralleled by the development of statistical methodologies to analyze the data they produce. Some of these newly developed methods are based on the assumption that the data observed or a transformation of the data are relatively symmetric with light tails, usually summarized by assuming a Gaussian random component. It is indeed very difficult to assess this assumption for small sample sizes. In this article, we utilize L-mome ...[more]

PMID: 25964664

Dataset Information

Shape analysis of high-throughput transcriptomics experiment data.

Publications

Shape analysis of high-throughput transcriptomics experiment data.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Signature analysis of high-throughput transcriptomics screening data for mechanistic inference and chemical grouping.
| S-EPMC12261139 | biostudies-literature

ZetaSuite: computational analysis of two-dimensional high-throughput data from multi-target screens and single-cell transcriptomics.
| S-EPMC9310463 | biostudies-literature

Rapid planning and analysis of high-throughput experiment arrays for reaction discovery.
| S-EPMC10318092 | biostudies-literature

Heat*seq: an interactive web tool for high-throughput sequencing experiment comparison with public data.
| S-EPMC5079476 | biostudies-literature

Population-level comparisons of gene regulatory networks modeled on high-throughput single-cell transcriptomics data.
| S-EPMC10965443 | biostudies-literature

Methods for high-throughput MethylCap-Seq data analysis.
| S-EPMC3481483 | biostudies-literature

High-Throughput Transcriptomics Platform for Screening Environmental Chemicals.
| S-EPMC10194851 | biostudies-literature

Probabilistic cross-link analysis and experiment planning for high-throughput elucidation of protein structure.
| S-EPMC2287312 | biostudies-literature

Comprehensive analysis of high-throughput transcriptomics to distinguish drug-induced liver injury (DILI) phenotypes.
| S-EPMC12408718 | biostudies-literature

Analysis of High-Throughput Flow Cytometry Data Using plateCore.
| S-EPMC2777006 | biostudies-literature