Unknown

Dataset Information

0

A novel strategy for classifying the output from an in silico vaccine discovery pipeline for eukaryotic pathogens using machine learning algorithms.


ABSTRACT: BACKGROUND:An in silico vaccine discovery pipeline for eukaryotic pathogens typically consists of several computational tools to predict protein characteristics. The aim of the in silico approach to discovering subunit vaccines is to use predicted characteristics to identify proteins which are worthy of laboratory investigation. A major challenge is that these predictions are inherent with hidden inaccuracies and contradictions. This study focuses on how to reduce the number of false candidates using machine learning algorithms rather than relying on expensive laboratory validation. Proteins from Toxoplasma gondii, Plasmodium sp., and Caenorhabditis elegans were used as training and test datasets. RESULTS:The results show that machine learning algorithms can effectively distinguish expected true from expected false vaccine candidates (with an average sensitivity and specificity of 0.97 and 0.98 respectively), for proteins observed to induce immune responses experimentally. CONCLUSIONS:Vaccine candidates from an in silico approach can only be truly validated in a laboratory. Given any in silico output and appropriate training data, the number of false candidates allocated for validation can be dramatically reduced using a pool of machine learning algorithms. This will ultimately save time and money in the laboratory.

SUBMITTER: Goodswen SJ 

PROVIDER: S-EPMC3826511 | biostudies-literature | 2013 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

A novel strategy for classifying the output from an in silico vaccine discovery pipeline for eukaryotic pathogens using machine learning algorithms.

Goodswen Stephen J SJ   Kennedy Paul J PJ   Ellis John T JT  

BMC bioinformatics 20131102


<h4>Background</h4>An in silico vaccine discovery pipeline for eukaryotic pathogens typically consists of several computational tools to predict protein characteristics. The aim of the in silico approach to discovering subunit vaccines is to use predicted characteristics to identify proteins which are worthy of laboratory investigation. A major challenge is that these predictions are inherent with hidden inaccuracies and contradictions. This study focuses on how to reduce the number of false can  ...[more]

Similar Datasets

| S-EPMC4207429 | biostudies-other
| S-EPMC8923015 | biostudies-literature
| S-EPMC4278717 | biostudies-literature
| S-EPMC7046186 | biostudies-literature
| S-EPMC4263066 | biostudies-literature
| S-EPMC3813700 | biostudies-literature
| S-EPMC7340908 | biostudies-literature
| S-EPMC5289882 | biostudies-literature
| S-EPMC8237624 | biostudies-literature
| S-EPMC6279870 | biostudies-literature