Unknown

Dataset Information

0

Detecting ulcerative colitis from colon samples using efficient feature selection and machine learning.


ABSTRACT: Ulcerative colitis (UC) is one of the most common forms of inflammatory bowel disease (IBD) characterized by inflammation of the mucosal layer of the colon. Diagnosis of UC is based on clinical symptoms, and then confirmed based on endoscopic, histologic and laboratory findings. Feature selection and machine learning have been previously used for creating models to facilitate the diagnosis of certain diseases. In this work, we used a recently developed feature selection algorithm (DRPT) combined with a support vector machine (SVM) classifier to generate a model to discriminate between healthy subjects and subjects with UC based on the expression values of 32 genes in colon samples. We validated our model with an independent gene expression dataset of colonic samples from subjects in active and inactive periods of UC. Our model perfectly detected all active cases and had an average precision of 0.62 in the inactive cases. Compared with results reported in previous studies and a model generated by a recently published software for biomarker discovery using machine learning (BioDiscML), our final model for detecting UC shows better performance in terms of average precision.

SUBMITTER: Khorasani HM 

PROVIDER: S-EPMC7426912 | biostudies-literature | 2020 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

Detecting ulcerative colitis from colon samples using efficient feature selection and machine learning.

Khorasani Hanieh Marvi HM   Usefi Hamid H   Peña-Castillo Lourdes L  

Scientific reports 20200813 1


Ulcerative colitis (UC) is one of the most common forms of inflammatory bowel disease (IBD) characterized by inflammation of the mucosal layer of the colon. Diagnosis of UC is based on clinical symptoms, and then confirmed based on endoscopic, histologic and laboratory findings. Feature selection and machine learning have been previously used for creating models to facilitate the diagnosis of certain diseases. In this work, we used a recently developed feature selection algorithm (DRPT) combined  ...[more]

Similar Datasets

| S-EPMC7038475 | biostudies-literature
| S-EPMC6567636 | biostudies-literature
| S-EPMC9556236 | biostudies-literature
| S-EPMC6446290 | biostudies-literature
| S-EPMC11307476 | biostudies-literature
| S-EPMC10989691 | biostudies-literature
| S-EPMC10009563 | biostudies-literature
| S-EPMC10015309 | biostudies-literature
| S-EPMC8286592 | biostudies-literature
| S-EPMC9999590 | biostudies-literature