Unknown

Dataset Information

0

Artificial Intelligence and Machine learning based prediction of resistant and susceptible mutations in Mycobacterium tuberculosis.


ABSTRACT: Tuberculosis (TB), an infectious disease caused by Mycobacterium tuberculosis (M.tb), causes highest number of deaths globally for any bacterial disease necessitating novel diagnosis and treatment strategies. High-throughput sequencing methods generate a large amount of data which could be exploited in determining multi-drug resistant (MDR-TB) associated mutations. The present work is a computational framework that uses artificial intelligence (AI) based machine learning (ML) approaches for predicting resistance in the genes rpoB, inhA, katG, pncA, gyrA and gyrB for the drugs rifampicin, isoniazid, pyrazinamide and fluoroquinolones. The single nucleotide variations were represented by several sequence and structural features that indicate the influence of mutations on the target protein coded by each gene. We used ML algorithms - naïve bayes, k nearest neighbor, support vector machine, and artificial neural network, to build the prediction models. The classification models had an average accuracy of 85% across all examined genes and were evaluated on an external unseen dataset to demonstrate their application. Further, molecular docking and molecular dynamics simulations were performed for wild type and predicted resistance causing mutant protein and anti-TB drug complexes to study their impact on the conformation of proteins to confirm the observed phenotype.

SUBMITTER: Jamal S 

PROVIDER: S-EPMC7099008 | biostudies-literature | 2020 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

Artificial Intelligence and Machine learning based prediction of resistant and susceptible mutations in Mycobacterium tuberculosis.

Jamal Salma S   Khubaib Mohd M   Gangwar Rishabh R   Grover Sonam S   Grover Abhinav A   Hasnain Seyed E SE  

Scientific reports 20200326 1


Tuberculosis (TB), an infectious disease caused by Mycobacterium tuberculosis (M.tb), causes highest number of deaths globally for any bacterial disease necessitating novel diagnosis and treatment strategies. High-throughput sequencing methods generate a large amount of data which could be exploited in determining multi-drug resistant (MDR-TB) associated mutations. The present work is a computational framework that uses artificial intelligence (AI) based machine learning (ML) approaches for pred  ...[more]

Similar Datasets

| S-EPMC8786279 | biostudies-literature
| S-EPMC8141697 | biostudies-literature
| S-EPMC8701097 | biostudies-literature
| S-EPMC9613681 | biostudies-literature
| S-EPMC8467682 | biostudies-literature
| S-EPMC9933281 | biostudies-literature
| S-EPMC8016865 | biostudies-literature
| S-EPMC9035975 | biostudies-literature
| S-EPMC7083992 | biostudies-literature
| S-EPMC7808396 | biostudies-literature