Unknown

Dataset Information

0

Generalized gene co-expression analysis via subspace clustering using low-rank representation.


ABSTRACT: BACKGROUND:Gene Co-expression Network Analysis (GCNA) helps identify gene modules with potential biological functions and has become a popular method in bioinformatics and biomedical research. However, most current GCNA algorithms use correlation to build gene co-expression networks and identify modules with highly correlated genes. There is a need to look beyond correlation and identify gene modules using other similarity measures for finding novel biologically meaningful modules. RESULTS:We propose a new generalized gene co-expression analysis algorithm via subspace clustering that can identify biologically meaningful gene co-expression modules with genes that are not all highly correlated. We use low-rank representation to construct gene co-expression networks and local maximal quasi-clique merger to identify gene co-expression modules. We applied our method on three large microarray datasets and a single-cell RNA sequencing dataset. We demonstrate that our method can identify gene modules with different biological functions than current GCNA methods and find gene modules with prognostic values. CONCLUSIONS:The presented method takes advantage of subspace clustering to generate gene co-expression networks rather than using correlation as the similarity measure between genes. Our generalized GCNA method can provide new insights from gene expression datasets and serve as a complement to current GCNA algorithms.

SUBMITTER: Wang T 

PROVIDER: S-EPMC6509871 | biostudies-literature | 2019 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

Generalized gene co-expression analysis via subspace clustering using low-rank representation.

Wang Tongxin T   Zhang Jie J   Huang Kun K  

BMC bioinformatics 20190501 Suppl 7


<h4>Background</h4>Gene Co-expression Network Analysis (GCNA) helps identify gene modules with potential biological functions and has become a popular method in bioinformatics and biomedical research. However, most current GCNA algorithms use correlation to build gene co-expression networks and identify modules with highly correlated genes. There is a need to look beyond correlation and identify gene modules using other similarity measures for finding novel biologically meaningful modules.<h4>Re  ...[more]

Similar Datasets

| S-EPMC5986621 | biostudies-literature
| S-EPMC3602020 | biostudies-literature
| S-EPMC6504107 | biostudies-literature
| S-EPMC5441581 | biostudies-literature
| S-EPMC7611820 | biostudies-literature
| S-EPMC7297185 | biostudies-literature
| S-EPMC5641478 | biostudies-literature
| S-EPMC5539778 | biostudies-other
| S-EPMC6364613 | biostudies-literature
| S-EPMC5953310 | biostudies-literature