Unknown

Dataset Information

0

Improving Gene-Set Enrichment Analysis of RNA-Seq Data with Small Replicates.


ABSTRACT: Deregulated pathways identified from transcriptome data of two sample groups have played a key role in many genomic studies. Gene-set enrichment analysis (GSEA) has been commonly used for pathway or functional analysis of microarray data, and it is also being applied to RNA-seq data. However, most RNA-seq data so far have only small replicates. This enforces to apply the gene-permuting GSEA method (or preranked GSEA) which results in a great number of false positives due to the inter-gene correlation in each gene-set. We demonstrate that incorporating the absolute gene statistic in one-tailed GSEA considerably improves the false-positive control and the overall discriminatory ability of the gene-permuting GSEA methods for RNA-seq data. To test the performance, a simulation method to generate correlated read counts within a gene-set was newly developed, and a dozen of currently available RNA-seq enrichment analysis methods were compared, where the proposed methods outperformed others that do not account for the inter-gene correlation. Analysis of real RNA-seq data also supported the proposed methods in terms of false positive control, ranks of true positives and biological relevance. An efficient R package (AbsFilterGSEA) coded with C++ (Rcpp) is available from CRAN.

SUBMITTER: Yoon S 

PROVIDER: S-EPMC5102490 | biostudies-literature | 2016

REPOSITORIES: biostudies-literature

altmetric image

Publications

Improving Gene-Set Enrichment Analysis of RNA-Seq Data with Small Replicates.

Yoon Sora S   Kim Seon-Young SY   Nam Dougu D  

PloS one 20161109 11


Deregulated pathways identified from transcriptome data of two sample groups have played a key role in many genomic studies. Gene-set enrichment analysis (GSEA) has been commonly used for pathway or functional analysis of microarray data, and it is also being applied to RNA-seq data. However, most RNA-seq data so far have only small replicates. This enforces to apply the gene-permuting GSEA method (or preranked GSEA) which results in a great number of false positives due to the inter-gene correl  ...[more]

Similar Datasets

| S-EPMC3622641 | biostudies-literature
| S-EPMC4117744 | biostudies-literature
| S-EPMC4248812 | biostudies-literature
| S-EPMC3618321 | biostudies-literature
| S-EPMC4265362 | biostudies-literature
| S-EPMC4161965 | biostudies-literature
| S-EPMC4870397 | biostudies-literature
| S-EPMC6404334 | biostudies-literature