Unknown

Dataset Information

0

Screening of characteristic genes in ulcerative colitis by integrating gene expression profiles


ABSTRACT:

Background

This study aimed to screen the feature modules and characteristic genes related to ulcerative colitis (UC) and construct a support vector machine (SVM) classifier to distinguish UC patients.

Methods

Four datasets that contained UC and control samples were obtained from the Gene Expression Omnibus database. Differentially expressed genes (DEGs) with consistency were screened via the MetaDE method. The weighted gene coexpression network (WGCNA) was used to distinguish significant modules based on the four datasets. The protein–protein interaction network was established based on intersection genes. Enrichment analysis of Gene Ontology (GO) biological processes (BPs) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment were established based on DAVID. An SVM combined with recursive feature elimination was also applied to construct a disease classifier for the disease diagnosis of UC patients. The efficacy of the SVM classifier was evaluated through receiver operating characteristic curves.

Results

Twelve highly preserved modules were obtained using the WGCNA, and 2009 DEGs with significant consistency were selected using the MetaDE method. Sixteen significantly related GO BPs and 12 KEGG pathways were obtained, such as cytokine-cytokine receptor interaction, cell adhesion molecules, and leukocyte transendothelial migration. Subsequently, 41 genes were used to construct an SVM classifier, such as CXCL1, CCR2, IL1B, and IL1A. The area under the curve (AUC) was 0.999 in the training dataset, whereas the AUC was 0.886, 0.790, and 0.819 in the validation set (GSE65114, GSE37283, and GSE36807, respectively).

Conclusions

An SVM classifier based on feature genes might correctly identify healthy people or UC patients.

Supplementary Information

The online version contains supplementary material available at 10.1186/s12876-021-01940-0.

SUBMITTER: Han Y 

PROVIDER: S-EPMC8556884 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC10835364 | biostudies-literature
| S-EPMC7711060 | biostudies-literature
| S-EPMC7587454 | biostudies-literature
| S-EPMC6554330 | biostudies-literature
| S-EPMC3026350 | biostudies-literature
| S-EPMC5931983 | biostudies-literature
| S-EPMC9685902 | biostudies-literature
| S-EPMC8578892 | biostudies-literature
| S-EPMC4883582 | biostudies-literature
| S-EPMC5445628 | biostudies-literature