Unknown

Dataset Information

0

A biochemically-interpretable machine learning classifier for microbial GWAS.


ABSTRACT: Current machine learning classifiers have successfully been applied to whole-genome sequencing data to identify genetic determinants of antimicrobial resistance (AMR), but they lack causal interpretation. Here we present a metabolic model-based machine learning classifier, named Metabolic Allele Classifier (MAC), that uses flux balance analysis to estimate the biochemical effects of alleles. We apply the MAC to a dataset of 1595 drug-tested Mycobacterium tuberculosis strains and show that MACs predict AMR phenotypes with accuracy on par with mechanism-agnostic machine learning models (isoniazid AUC?=?0.93) while enabling a biochemical interpretation of the genotype-phenotype map. Interpretation of MACs for three antibiotics (pyrazinamide, para-aminosalicylic acid, and isoniazid) recapitulates known AMR mechanisms and suggest a biochemical basis for how the identified alleles cause AMR. Extending flux balance analysis to identify accurate sequence classifiers thus contributes mechanistic insights to GWAS, a field thus far dominated by mechanism-agnostic results.

SUBMITTER: Kavvas ES 

PROVIDER: S-EPMC7244534 | biostudies-literature | 2020 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

A biochemically-interpretable machine learning classifier for microbial GWAS.

Kavvas Erol S ES   Yang Laurence L   Monk Jonathan M JM   Heckmann David D   Palsson Bernhard O BO  

Nature communications 20200522 1


Current machine learning classifiers have successfully been applied to whole-genome sequencing data to identify genetic determinants of antimicrobial resistance (AMR), but they lack causal interpretation. Here we present a metabolic model-based machine learning classifier, named Metabolic Allele Classifier (MAC), that uses flux balance analysis to estimate the biochemical effects of alleles. We apply the MAC to a dataset of 1595 drug-tested Mycobacterium tuberculosis strains and show that MACs p  ...[more]

Similar Datasets

| S-EPMC11308548 | biostudies-literature
| S-EPMC7937228 | biostudies-literature
2024-01-27 | GSE230012 | GEO
2024-01-28 | GSE230011 | GEO
2024-01-28 | GSE230010 | GEO
2024-01-28 | GSE230007 | GEO
2024-01-28 | GSE230006 | GEO
| S-EPMC6825274 | biostudies-literature
| S-EPMC5553725 | biostudies-other
| S-EPMC5981198 | biostudies-literature