Unknown

Dataset Information

0

Identification of Proteins of Tobacco Mosaic Virus by Using a Method of Feature Extraction.


ABSTRACT: Tobacco mosaic virus, TMV for short, is widely distributed in the global tobacco industry and has a significant impact on tobacco production. It can reduce the amount of tobacco grown by 50-70%. In this research of study, we aimed to identify tobacco mosaic virus proteins and healthy tobacco leaf proteins by using machine learning approaches. The experiment's results showed that the support vector machine algorithm achieved high accuracy in different feature extraction methods. And 188-dimensions feature extraction method improved the classification accuracy. In that the support vector machine algorithm and 188-dimensions feature extraction method were finally selected as the final experimental methods. In the 10-fold cross-validation processes, the SVM combined with 188-dimensions achieved 93.5% accuracy on the training set and 92.7% accuracy on the independent validation set. Besides, the evaluation index of the results of experiments indicate that the method developed by us is valid and robust.

SUBMITTER: Chen YM 

PROVIDER: S-EPMC7581905 | biostudies-literature | 2020

REPOSITORIES: biostudies-literature

altmetric image

Publications

Identification of Proteins of Tobacco Mosaic Virus by Using a Method of Feature Extraction.

Chen Yu-Miao YM   Zu Xin-Ping XP   Li Dan D  

Frontiers in genetics 20201009


Tobacco mosaic virus, TMV for short, is widely distributed in the global tobacco industry and has a significant impact on tobacco production. It can reduce the amount of tobacco grown by 50-70%. In this research of study, we aimed to identify tobacco mosaic virus proteins and healthy tobacco leaf proteins by using machine learning approaches. The experiment's results showed that the support vector machine algorithm achieved high accuracy in different feature extraction methods. And 188-dimension  ...[more]

Similar Datasets

| PRJNA665939 | ENA
| S-EPMC4426350 | biostudies-literature
| PRJNA924249 | ENA
| S-EPMC6988427 | biostudies-literature
| S-EPMC6006778 | biostudies-literature
| S-EPMC5353609 | biostudies-literature
| S-EPMC6983983 | biostudies-literature
| S-EPMC6523586 | biostudies-literature
| S-EPMC4628775 | biostudies-literature
| S-EPMC3560112 | biostudies-literature