Unknown

Dataset Information

0

A comparison of classification methods for predicting Chronic Fatigue Syndrome based on genetic data.


ABSTRACT: BACKGROUND: In the studies of genomics, it is essential to select a small number of genes that are more significant than the others for the association studies of disease susceptibility. In this work, our goal was to compare computational tools with and without feature selection for predicting chronic fatigue syndrome (CFS) using genetic factors such as single nucleotide polymorphisms (SNPs). METHODS: We employed the dataset that was original to the previous study by the CDC Chronic Fatigue Syndrome Research Group. To uncover relationships between CFS and SNPs, we applied three classification algorithms including naive Bayes, the support vector machine algorithm, and the C4.5 decision tree algorithm. Furthermore, we utilized feature selection methods to identify a subset of influential SNPs. One was the hybrid feature selection approach combining the chi-squared and information-gain methods. The other was the wrapper-based feature selection method. RESULTS: The naive Bayes model with the wrapper-based approach performed maximally among predictive models to infer the disease susceptibility dealing with the complex relationship between CFS and SNPs. CONCLUSION: We demonstrated that our approach is a promising method to assess the associations between CFS and SNPs.

SUBMITTER: Huang LC 

PROVIDER: S-EPMC2765429 | biostudies-literature | 2009

REPOSITORIES: biostudies-literature

altmetric image

Publications

A comparison of classification methods for predicting Chronic Fatigue Syndrome based on genetic data.

Huang Lung-Cheng LC   Hsu Sen-Yen SY   Lin Eugene E  

Journal of translational medicine 20090922


<h4>Background</h4>In the studies of genomics, it is essential to select a small number of genes that are more significant than the others for the association studies of disease susceptibility. In this work, our goal was to compare computational tools with and without feature selection for predicting chronic fatigue syndrome (CFS) using genetic factors such as single nucleotide polymorphisms (SNPs).<h4>Methods</h4>We employed the dataset that was original to the previous study by the CDC Chronic  ...[more]

Similar Datasets

| S-EPMC8012483 | biostudies-literature
2020-07-07 | PXD016622 | Pride
2009-01-27 | GSE14577 | GEO
| S-EPMC4982549 | biostudies-literature
| S-EPMC6055066 | biostudies-literature
2014-08-12 | GSE59489 | GEO
2014-08-12 | E-GEOD-59489 | biostudies-arrayexpress
| S-EPMC5443514 | biostudies-literature
| S-EPMC7037777 | biostudies-literature
| S-EPMC10471690 | biostudies-literature