Unknown

Dataset Information

0

Pathway analysis for RNA-Seq data using a score-based approach.


ABSTRACT: A variety of pathway/gene-set approaches have been proposed to provide evidence of higher-level biological phenomena in the association of expression with experimental condition or clinical outcome. Among these approaches, it has been repeatedly shown that resampling methods are far preferable to approaches that implicitly assume independence of genes. However, few approaches have been optimized for the specific characteristics of RNA-Seq transcription data, in which mapped tags produce discrete counts with varying library sizes, and with potential outliers or skewness patterns that violate parametric assumptions. We describe transformations to RNA-Seq data to improve power for linear associations with outcome and flexibly handle normalization factors. Using these transformations or alternate transformations, we apply recently developed null approximations to quadratic form statistics for both self-contained and competitive pathway testing. The approach provides a convenient integrated platform for RNA-Seq pathway testing. We demonstrate that the approach provides appropriate type I error control without actual permutation and is powerful under many settings in comparison to competing approaches. Pathway analysis of data from a study of F344 vs. HIV1Tg rats, and of sex differences in lymphoblastoid cell lines from humans, strongly supports the biological interpretability of the findings.

SUBMITTER: Zhou YH 

PROVIDER: S-EPMC4992401 | biostudies-literature | 2016 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

Pathway analysis for RNA-Seq data using a score-based approach.

Zhou Yi-Hui YH  

Biometrics 20150810 1


A variety of pathway/gene-set approaches have been proposed to provide evidence of higher-level biological phenomena in the association of expression with experimental condition or clinical outcome. Among these approaches, it has been repeatedly shown that resampling methods are far preferable to approaches that implicitly assume independence of genes. However, few approaches have been optimized for the specific characteristics of RNA-Seq transcription data, in which mapped tags produce discrete  ...[more]

Similar Datasets

| S-EPMC6016759 | biostudies-literature
| S-EPMC4304217 | biostudies-other
| S-EPMC4625615 | biostudies-literature
| S-EPMC8527798 | biostudies-literature
| S-EPMC5513449 | biostudies-other
| S-EPMC3650863 | biostudies-literature
| S-EPMC4924883 | biostudies-literature
| S-EPMC4674845 | biostudies-literature
| S-EPMC5862256 | biostudies-literature
| S-EPMC5267345 | biostudies-literature