Journal of the American Medical Informatics Association : JAMIA | 2021

An explainable machine learning platform for pyrazinamide resistance prediction and genetic feature identification of Mycobacterium tuberculosis

 
 
 

Abstract


OBJECTIVE\nTuberculosis is the leading cause of death from a single infectious agent. The emergence of antimicrobial resistant Mycobacterium tuberculosis strains makes the problem more severe. Pyrazinamide (PZA) is an important component for short-course treatment regimens and first- and second-line treatment regimens. This research aims for fast diagnosis of M. tuberculosis resistance to PZA and identification of genetic features causing resistance.\n\n\nMATERIALS AND METHODS\nWe use clinically collected genomic data of M. tuberculosis that are resistant or susceptible to PZA. A machine learning platform is built to diagnose PZA resistance using the whole genome sequence data, and to identify resistance genes and mutations. The platform consists of a deep convolutional neural network (DCNN) model for resistance diagnosis and a support vector machine (SVM) model as a surrogate to identify resistance genes and mutations.\n\n\nRESULTS\nThe DCNN model achieves a PZA resistance diagnosis accuracy of 93%. Each prediction takes less than a second. The SVM has revealed 2 novel genes, embB and gyrA, besides the well-known pncA gene, and 9 mutations that harbor PZA resistance.\n\n\nDISCUSSION\nThe DCNN and SVM machine learning platform, if used together with the real-time genome sequencing machines, could allow for rapid PZA diagnosis, allowing for critical time to ensure good patient outcomes, and preventing outbreaks of deadly infections. Furthermore, identifying pertinent resistance genes and mutations will help researchers better understand the biological mechanisms behind resistance.\n\n\nCONCLUSIONS\nMachine learning can be used to achieve high-accuracy resistance prediction, and identify genes and mutations causing the resistance.

Volume None
Pages None
DOI 10.1093/jamia/ocaa233
Language English
Journal Journal of the American Medical Informatics Association : JAMIA

Full Text