Unknown

Dataset Information

0

GSEApy: a comprehensive package for performing gene set enrichment analysis in Python.


ABSTRACT:

Motivation

Gene set enrichment analysis (GSEA) is a commonly used algorithm for characterizing gene expression changes. However, the currently available tools used to perform GSEA have a limited ability to analyze large datasets, which is particularly problematic for the analysis of single-cell data. To overcome this limitation, we developed a GSEA package in Python (GSEApy), which could efficiently analyze large single-cell datasets.

Results

We present a package (GSEApy) that performs GSEA in either the command line or Python environment. GSEApy uses a Rust implementation to enable it to calculate the same enrichment statistic as GSEA for a collection of pathways. The Rust implementation of GSEApy is 3-fold faster than the Numpy version of GSEApy (v0.10.8) and uses >4-fold less memory. GSEApy also provides an interface between Python and Enrichr web services, as well as for BioMart. The Enrichr application programming interface enables GSEApy to perform over-representation analysis for an input gene list. Furthermore, GSEApy consists of several tools, each designed to facilitate a particular type of enrichment analysis.

Availability and implementation

The new GSEApy with Rust extension is deposited in PyPI: https://pypi.org/project/gseapy/. The GSEApy source code is freely available at https://github.com/zqfang/GSEApy. Also, the documentation website is available at https://gseapy.rtfd.io/.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Fang Z 

PROVIDER: S-EPMC9805564 | biostudies-literature | 2023 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

GSEApy: a comprehensive package for performing gene set enrichment analysis in Python.

Fang Zhuoqing Z   Liu Xinyuan X   Peltz Gary G  

Bioinformatics (Oxford, England) 20230101 1


<h4>Motivation</h4>Gene set enrichment analysis (GSEA) is a commonly used algorithm for characterizing gene expression changes. However, the currently available tools used to perform GSEA have a limited ability to analyze large datasets, which is particularly problematic for the analysis of single-cell data. To overcome this limitation, we developed a GSEA package in Python (GSEApy), which could efficiently analyze large single-cell datasets.<h4>Results</h4>We present a package (GSEApy) that per  ...[more]

Similar Datasets

| S-EPMC4987924 | biostudies-literature
| S-EPMC5860183 | biostudies-other
| S-EPMC9252808 | biostudies-literature
| S-EPMC6404334 | biostudies-literature
| S-EPMC3505158 | biostudies-literature
| S-EPMC5570149 | biostudies-literature
| S-EPMC3436816 | biostudies-other
| S-EPMC1933132 | biostudies-literature
| S-EPMC1183189 | biostudies-literature
| S-EPMC6421703 | biostudies-literature