Unknown

Dataset Information

0

Reliable multi-label learning via conformal predictor and random forest for syndrome differentiation of chronic fatigue in traditional Chinese medicine.


ABSTRACT:

Objective

Chronic Fatigue (CF) still remains unclear about its etiology, pathophysiology, nomenclature and diagnostic criteria in the medical community. Traditional Chinese medicine (TCM) adopts a unique diagnostic method, namely 'bian zheng lun zhi' or syndrome differentiation, to diagnose the CF with a set of syndrome factors, which can be regarded as the Multi-Label Learning (MLL) problem in the machine learning literature. To obtain an effective and reliable diagnostic tool, we use Conformal Predictor (CP), Random Forest (RF) and Problem Transformation method (PT) for the syndrome differentiation of CF.

Methods and materials

In this work, using PT method, CP-RF is extended to handle MLL problem. CP-RF applies RF to measure the confidence level (p-value) of each label being the true label, and then selects multiple labels whose p-values are larger than the pre-defined significance level as the region prediction. In this paper, we compare the proposed CP-RF with typical CP-NBC(Naïve Bayes Classifier), CP-KNN(K-Nearest Neighbors) and ML-KNN on CF dataset, which consists of 736 cases. Specifically, 95 symptoms are used to identify CF, and four syndrome factors are employed in the syndrome differentiation, including 'spleen deficiency', 'heart deficiency', 'liver stagnation' and 'qi deficiency'.

The results

CP-RF demonstrates an outstanding performance beyond CP-NBC, CP-KNN and ML-KNN under the general metrics of subset accuracy, hamming loss, one-error, coverage, ranking loss and average precision. Furthermore, the performance of CP-RF remains steady at the large scale of confidence levels from 80% to 100%, which indicates its robustness to the threshold determination. In addition, the confidence evaluation provided by CP is valid and well-calibrated.

Conclusion

CP-RF not only offers outstanding performance but also provides valid confidence evaluation for the CF syndrome differentiation. It would be well applicable to TCM practitioners and facilitate the utilities of objective, effective and reliable computer-based diagnosis tool.

SUBMITTER: Wang H 

PROVIDER: S-EPMC4053362 | biostudies-literature | 2014

REPOSITORIES: biostudies-literature

altmetric image

Publications

Reliable multi-label learning via conformal predictor and random forest for syndrome differentiation of chronic fatigue in traditional Chinese medicine.

Wang Huazhen H   Liu Xin X   Lv Bing B   Yang Fan F   Hong Yanzhu Y  

PloS one 20140611 6


<h4>Objective</h4>Chronic Fatigue (CF) still remains unclear about its etiology, pathophysiology, nomenclature and diagnostic criteria in the medical community. Traditional Chinese medicine (TCM) adopts a unique diagnostic method, namely 'bian zheng lun zhi' or syndrome differentiation, to diagnose the CF with a set of syndrome factors, which can be regarded as the Multi-Label Learning (MLL) problem in the machine learning literature. To obtain an effective and reliable diagnostic tool, we use C  ...[more]

Similar Datasets

| PRJNA796441 | ENA
| S-EPMC3958672 | biostudies-other
2017-03-26 | GSE85871 | GEO
| PRJNA556807 | ENA
| S-EPMC7281804 | biostudies-literature
| S-EPMC7817705 | biostudies-literature
| S-EPMC5363012 | biostudies-literature
| S-EPMC7054385 | biostudies-literature
| S-EPMC4955772 | biostudies-literature
2019-11-22 | GSE140769 | GEO