Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Predicting Antimicrobial Resistance Using Partial Genome Alignments

Journal Article · · mSystems
 [1];  [2];  [3];  [2];  [1];  [1];  [4]
  1. Technical Univ. of Denmark, Lyngby (Denmark)
  2. Univ. of Chicago, IL (United States). Consortium for Advanced Science and Engineering; Argonne National Lab. (ANL), Argonne, IL (United States)
  3. National Food Institute, Technical University of Denmark, Kongens Lyngby, Denmark
  4. Univ. of Chicago, IL (United States). Consortium for Advanced Science and Engineering; Argonne National Lab. (ANL), Argonne, IL (United States); Northwest Argonne Institute for Science and Engineering, Evanston, IL (United States)
Antimicrobial resistance (AMR) is an important global health threat that impacts millions of people worldwide each year. Developing methods that can detect and predict AMR phenotypes can help to mitigate the spread of AMR by informing clinical decision making and appropriate mitigation strategies. Many bioinformatic methods have been developed for predicting AMR phenotypes from whole-genome sequences and AMR genes, but recent studies have indicated that predictions can be made from incomplete genome sequence data. In order to more systematically understand this, we built random forest-based machine learning classifiers for predicting susceptible and resistant phenotypes for Klebsiella pneumoniae (1,640 strains), Mycobacterium tuberculosis (2,497 strains), and Salmonella enterica (1,981 strains). We started by building models from alignments that were based on a reference chromosome for each species. We then subsampled each chromosomal alignment and built models for the resulting subalignments, finding that very small regions, representing approximately 0.1 to 0.2% of the chromosome, are predictive. In K. pneumoniae, M. tuberculosis, and S. enterica, the subalignments are able to predict multiple AMR phenotypes with at least 70% accuracy, even though most do not encode an AMR-related function. We used these models to identify regions of the chromosome with high and low predictive signals. Finally, subalignments that retain high accuracy across larger phylogenetic distances were examined in greater detail, revealing genes and intergenic regions with potential links to AMR, virulence, transport, and survival under stress conditions. IMPORTANCE Antimicrobial resistance causes thousands of deaths annually worldwide. Understanding the regions of the genome that are involved in antimicrobial resistance is important for developing mitigation strategies and preventing transmission. Machine learning models are capable of predicting antimicrobial resistance phenotypes from bacterial genome sequence data by identifying resistance genes, mutations, and other correlated features. They are also capable of implicating regions of the genome that have not been previously characterized as being involved in resistance. In this study, we generated global chromosomal alignments for Klebsiella pneumoniae, Mycobacterium tuberculosis, and Salmonella enterica and systematically searched them for small conserved regions of the genome that enable the prediction of antimicrobial resistance phenotypes. In addition to known antimicrobial resistance genes, this analysis identified genes involved in virulence and transport functions, as well as many genes with no previous implication in antimicrobial resistance.
Research Organization:
Argonne National Laboratory (ANL), Argonne, IL (United States)
Sponsoring Organization:
Defense Advanced Research Projects Agency; National Institutes of Health (NIH). National Institute of Allergy and Infectious Diseases (NIAID); Novo Nordisk Foundation
Grant/Contract Number:
AC02-06CH11357
OSTI ID:
1854526
Journal Information:
mSystems, Journal Name: mSystems Journal Issue: 3 Vol. 6; ISSN 2379-5077
Publisher:
American Society for MicrobiologyCopyright Statement
Country of Publication:
United States
Language:
English

References (56)

Selection of representative protein data sets journal March 1992
Efflux drug transporters at the forefront of antimicrobial resistance journal July 2017
Learning from imbalanced data: open challenges and future directions journal April 2016
Learning similarity measures from data journal October 2019
Iron/sulfur proteins biogenesis in prokaryotes: Formation, regulation and diversity journal March 2013
The rpoB gene as a tool for clinical microbiologists journal January 2009
Random Forests journal January 2001
Linkage disequilibrium — understanding the evolutionary past and mapping the medical future journal June 2008
Machine learning and structural analysis of Mycobacterium tuberculosis pan-genome identifies genetic signatures of antibiotic resistance journal October 2018
Genome-wide discovery of epistatic loci affecting antibiotic resistance in Neisseria gonorrhoeae using evolutionary couplings journal December 2018
Rapid inference of antibiotic resistance and susceptibility by genomic neighbour typing journal February 2020
Sequencing-based methods and resources to study antimicrobial resistance journal March 2019
Developing an in silico minimum inhibitory concentration panel test for Klebsiella pneumoniae journal January 2018
Integrating standardized whole genome sequence analysis with a global Mycobacterium tuberculosis antibiotic resistance knowledgebase journal October 2018
Interpretable genotype-to-phenotype classifiers with performance guarantees journal March 2019
Genomic profiling of antimicrobial resistance genes in clinical isolates of Salmonella Typhi from patients infected with Typhoid fever in India journal May 2020
Understanding and predicting ciprofloxacin minimum inhibitory concentration in Escherichia coli with machine learning journal September 2020
Antimicrobial Resistance Prediction in PATRIC and RAST journal June 2016
PATRIC as a unique resource for studying antimicrobial resistance journal July 2017
KMC 2: fast and resource-frugal k-mer counting journal January 2015
A pan-genome-based machine learning approach for predicting antimicrobial resistance activities of the Escherichia coli strains journal June 2018
Collaborative Effort for a Centralized Worldwide Tuberculosis Relational Sequencing Data Platform: Figure 1. journal September 2015
Role of the DnaK and HscA homologs of Hsp70 chaperones in protein folding in E.coli journal August 1998
The Transporter Classification Database (TCDB): recent advances journal November 2015
Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation journal November 2015
VFDB 2019: a comparative pathogenomic platform with an interactive web interface journal November 2018
Victors: a web-based knowledge base of virulence factors in human and animal pathogens journal October 2018
Interactive Tree Of Life (iTOL) v4: recent updates and new developments journal April 2019
CARD 2020: antibiotic resistome surveillance with the comprehensive antibiotic resistance database journal October 2019
The PATRIC Bioinformatics Resource Center: expanding data and analysis capabilities journal October 2019
Antibiotic export: transporters involved in the final step of natural product production journal August 2019
Small sample size effects in statistical pattern recognition: recommendations for practitioners journal March 1991
Predicting antimicrobial susceptibility from the bacterial genome: A new paradigm for one health resistance monitoring journal October 2020
Clinically relevant mutations in core metabolic genes confer antibiotic resistance journal February 2021
Validating the AMRFinder Tool and Resistance Gene Database by Using Antimicrobial Resistance Genotype-Phenotype Correlations in a Collection of Isolates journal August 2019
Nomenclature for Macrolide and Macrolide-Lincosamide-Streptogramin B Resistance Determinants journal December 1999
Horizontal Gene Transfer and Acquired Antibiotic Resistance in Salmonella enterica Serovar Heidelberg following In Vitro Incubation in Broiler Ceca journal November 2019
Adaptive and Mutational Resistance: Role of Porins and Efflux Pumps in Drug Resistance journal October 2012
Antimicrobial Resistance and Virulence: a Successful or Deleterious Association in the Bacterial World? journal April 2013
Cysteine Desulfurase IscS2 Plays a Role in Oxygen Resistance in Clostridium difficile journal August 2018
Using Machine Learning To Predict Antimicrobial MICs and Associated Genomic Features for Nontyphoidal Salmonella journal October 2018
Diagnostic Accuracy and Reproducibility of WHO-Endorsed Phenotypic Drug Susceptibility Testing Methods for First-Line and Second-Line Antituberculosis Drugs journal February 2013
Characterization of the katG and inhA genes of isoniazid-resistant clinical isolates of Mycobacterium tuberculosis journal November 1995
Molecular Genetic Analysis of Nucleotide Polymorphisms Associated with Ethambutol Resistance in Human Isolates of Mycobacterium tuberculosis journal February 2000
Effect of rpoB Mutations Conferring Rifampin Resistance on Fitness of Mycobacterium tuberculosis journal March 2004
Prediction of Acquired Antimicrobial Resistance for Multiple Bacterial Species Using Neural Networks journal January 2020
Presence of katG gene in resistant Mycobacterium tuberculosis. journal November 1996
Antimicrobial resistance: risk associated with antibiotic overuse and initiatives to reduce the problem journal September 2014
Drug-susceptibility testing in tuberculosis: methods and reliability of results journal March 2005
Rapid and precise alignment of raw reads against redundant databases with KMA journal August 2018
A machine learning-based service for estimating quality of genomes using PATRIC journal October 2019
Prediction of antibiotic resistance in Escherichia coli from large-scale pan-genome data journal December 2018
VAMPr: VAriant Mapping and Prediction of antibiotic resistance via explainable features and machine learning journal January 2020
Machine learning with random subspace ensembles identifies antimicrobial resistance determinants from pan-genomes of three pathogens journal March 2020
DnaK as Antibiotic Target: Hot Spot Residues Analysis for Differential Inhibition of the Bacterial Protein in Comparison with the Human HSP70 journal April 2015
Epistasis and the Evolution of Antimicrobial Resistance journal February 2017

Similar Records

Predicting antimicrobial resistance using conserved genes
Journal Article · Mon Oct 19 00:00:00 EDT 2020 · PLoS Computational Biology (Online) · OSTI ID:1757970

Genotypic and Phenotypic Characterization of Incompatibility Group FIB Positive Salmonella enterica Serovar Typhimurium Isolates from Food Animal Sources
Journal Article · Tue Nov 03 23:00:00 EST 2020 · Genes · OSTI ID:1816813