Dataset Information

An Improved Deep Learning Model: S-TextBLCNN for Traditional Chinese Medicine Formula Classification.

ABSTRACT: Purpose: This study proposes an S-TextBLCNN model for the efficacy of traditional Chinese medicine (TCM) formula classification. This model uses deep learning to analyze the relationship between herb efficacy and formula efficacy, which is helpful in further exploring the internal rules of formula combination. Methods: First, for the TCM herbs extracted from Chinese Pharmacopoeia, natural language processing (NLP) is used to learn and realize the quantitative expression of different TCM herbs. Three features of herb name, herb properties, and herb efficacy are selected to encode herbs and to construct formula-vector and herb-vector. Then, based on 2,664 formulae for stroke collected in TCM literature and 19 formula efficacy categories extracted from Yifang Jijie, an improved deep learning model TextBLCNN consists of a bidirectional long short-term memory (Bi-LSTM) neural network and a convolutional neural network (CNN) is proposed. Based on 19 formula efficacy categories, binary classifiers are established to classify the TCM formulae. Finally, aiming at the imbalance problem of formula data, the over-sampling method SMOTE is used to solve it and the S-TextBLCNN model is proposed. Results: The formula-vector composed of herb efficacy has the best effect on the classification model, so it can be inferred that there is a strong relationship between herb efficacy and formula efficacy. The TextBLCNN model has an accuracy of 0.858 and an F₁-score of 0.762, both higher than the logistic regression (acc = 0.561, F₁-score = 0.567), SVM (acc = 0.703, F₁-score = 0.591), LSTM (acc = 0.723, F₁-score = 0.621), and TextCNN (acc = 0.745, F₁-score = 0.644) models. In addition, the over-sampling method SMOTE is used in our model to tackle data imbalance, and the F₁-score is greatly improved by an average of 47.1% in 19 models. Conclusion: The combination of formula feature representation and the S-TextBLCNN model improve the accuracy in formula efficacy classification. It provides a new research idea for the study of TCM formula compatibility.

SUBMITTER: Cheng N

PROVIDER: S-EPMC8727750 | biostudies-literature | 2021

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

An Improved Deep Learning Model: S-TextBLCNN for Traditional Chinese Medicine Formula Classification.

Cheng Ning N Chen Yue Y Gao Wanqing W Liu Jiajun J Huang Qunfu Q Yan Cheng C Huang Xindi X Ding Changsong C

Frontiers in genetics 20211222

Purpose: This study proposes an S-TextBLCNN model for the efficacy of traditional Chinese medicine (TCM) formula classification. This model uses deep learning to analyze the relationship between herb efficacy and formula efficacy, which is helpful in further exploring the internal rules of formula combination. Methods: First, for the TCM herbs extracted from Chinese Pharmacopoeia, natural language processing (NLP) is used to learn and realize the quantitative expression of d ...[more]

PMID: 35003231

Dataset Information

An Improved Deep Learning Model: S-TextBLCNN for Traditional Chinese Medicine Formula Classification.

Publications

An Improved Deep Learning Model: S-TextBLCNN for Traditional Chinese Medicine Formula Classification.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

A novel HbA1c-lowering traditional Chinese medicinal formula, identified by translational medicine study
2016-12-05 | GSE53119 | GEO

Image recognition of traditional Chinese medicine based on deep learning.
| S-EPMC10402920 | biostudies-literature

Expression date from db/db mice with or without treatment of traditional Chinese medicine Tang-shen Formula
2017-01-25 | GSE90842 | GEO

Phytochemistry, Pharmacology and Quality Control of Xiasangju: A Traditional Chinese Medicine Formula.
| S-EPMC9259862 | biostudies-literature

Determining diabetic kidney disease severity using traditional Chinese medicine syndrome classification.
| S-EPMC11479003 | biostudies-literature

dTGS: Method for Effective Components Identification from Traditional Chinese Medicine Formula and Mechanism Analysis.
| S-EPMC3878852 | biostudies-literature

Smart Soup, a traditional Chinese medicine formula, ameliorates amyloid pathology and related cognitive deficits.
| S-EPMC4227681 | biostudies-literature

Predicting Meridian in Chinese traditional medicine using machine learning approaches.
| S-EPMC6876772 | biostudies-literature

Machine learning-assisted rapid determination for traditional Chinese Medicine Constitution.
| S-EPMC11403957 | biostudies-literature