Unknown

Dataset Information

0

Mapping membrane activity in undiscovered peptide sequence space using machine learning.


ABSTRACT: There are some ?1,100 known antimicrobial peptides (AMPs), which permeabilize microbial membranes but have diverse sequences. Here, we develop a support vector machine (SVM)-based classifier to investigate ?-helical AMPs and the interrelated nature of their functional commonality and sequence homology. SVM is used to search the undiscovered peptide sequence space and identify Pareto-optimal candidates that simultaneously maximize the distance ? from the SVM hyperplane (thus maximize its "antimicrobialness") and its ?-helicity, but minimize mutational distance to known AMPs. By calibrating SVM machine learning results with killing assays and small-angle X-ray scattering (SAXS), we find that the SVM metric ? correlates not with a peptide's minimum inhibitory concentration (MIC), but rather its ability to generate negative Gaussian membrane curvature. This surprising result provides a topological basis for membrane activity common to AMPs. Moreover, we highlight an important distinction between the maximal recognizability of a sequence to a trained AMP classifier (its ability to generate membrane curvature) and its maximal antimicrobial efficacy. As mutational distances are increased from known AMPs, we find AMP-like sequences that are increasingly difficult for nature to discover via simple mutation. Using the sequence map as a discovery tool, we find a unexpectedly diverse taxonomy of sequences that are just as membrane-active as known AMPs, but with a broad range of primary functions distinct from AMP functions, including endogenous neuropeptides, viral fusion proteins, topogenic peptides, and amyloids. The SVM classifier is useful as a general detector of membrane activity in peptide sequences.

SUBMITTER: Lee EY 

PROVIDER: S-EPMC5137689 | biostudies-literature | 2016 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Mapping membrane activity in undiscovered peptide sequence space using machine learning.

Lee Ernest Y EY   Fulan Benjamin M BM   Wong Gerard C L GC   Ferguson Andrew L AL  

Proceedings of the National Academy of Sciences of the United States of America 20161114 48


There are some ∼1,100 known antimicrobial peptides (AMPs), which permeabilize microbial membranes but have diverse sequences. Here, we develop a support vector machine (SVM)-based classifier to investigate ⍺-helical AMPs and the interrelated nature of their functional commonality and sequence homology. SVM is used to search the undiscovered peptide sequence space and identify Pareto-optimal candidates that simultaneously maximize the distance σ from the SVM hyperplane (thus maximize its "antimic  ...[more]

Similar Datasets

2021-06-02 | GSE175942 | GEO
| S-EPMC6803653 | biostudies-literature
| S-EPMC8201876 | biostudies-literature
| S-EPMC7335209 | biostudies-literature
2013-01-01 | E-GEOD-29210 | biostudies-arrayexpress
| S-EPMC7314597 | biostudies-literature
| S-EPMC7603480 | biostudies-literature
| PRJNA734333 | ENA
| S-EPMC3404069 | biostudies-literature
| S-EPMC4329842 | biostudies-literature