首页> 外文期刊>mSystems >Predicting Phenotypic Polymyxin Resistance in Klebsiella pneumoniae through Machine Learning Analysis of Genomic Data
【24h】

Predicting Phenotypic Polymyxin Resistance in Klebsiella pneumoniae through Machine Learning Analysis of Genomic Data

机译:通过基因组数据的机器学习分析预测Klebsiella肺炎的表型多酶抗性

获取原文
           

摘要

Polymyxins are used as treatments of last resort for Gram-negative bacterial infections. Their increased use has led to concerns about emerging polymyxin resistance (PR). Phenotypic polymyxin susceptibility testing is resource intensive and difficult to perform accurately. The complex polygenic nature of PR and our incomplete understanding of its genetic basis make it difficult to predict PR using detection of resistance determinants. We therefore applied machine learning (ML) to whole-genome sequencing data from 600 Klebsiella pneumoniae clonal group 258 (CG258) genomes to predict phenotypic PR. Using a reference-based representation of genomic data with ML outperformed a rule-based approach that detected variants in known PR genes (area under receiver-operator curve [AUROC], 0.894 versus 0.791, P?=? 0.006). We noted modest increases in performance by using a bacterial genome-wide association study to filter relevant genomic features and by integrating clinical data in the form of prior polymyxin exposure. Conversely, reference-free representation of genomic data as k-mers was associated with decreased performance (AUROC, 0.692 versus 0.894, P?=? 0.015). When ML models were interpreted to extract genomic features, six of seven known PR genes were correctly identified by models without prior programming and several genes involved in stress responses and maintenance of the cell membrane were identified as potential novel determinants of PR. These findings are a proof of concept that whole-genome sequencing data can accurately predict PR in K. pneumoniae CG258 and may be applicable to other forms of complex antimicrobial resistance. IMPORTANCE Polymyxins are last-resort antibiotics used to treat highly resistant Gram-negative bacteria. There are increasing reports of polymyxin resistance emerging, raising concerns of a postantibiotic era. Polymyxin resistance is therefore a significant public health threat, but current phenotypic methods for detection are difficult and time-consuming to perform. There have been increasing efforts to use whole-genome sequencing for detection of antibiotic resistance, but this has been difficult to apply to polymyxin resistance because of its complex polygenic nature. The significance of our research is that we successfully applied machine learning methods to predict polymyxin resistance in Klebsiella pneumoniae clonal group 258, a common health care-associated and multidrug-resistant pathogen. Our findings highlight that machine learning can be successfully applied even in complex forms of antibiotic resistance and represent a significant contribution to the literature that could be used to predict resistance in other bacteria and to other antibiotics.
机译:多肾上腺素用作革兰氏阴性细菌感染的最后手段的治疗方法。它们的使用增加导致了对新出现的多粘素抗性(PR)的担忧。表型多粘素易感性测试是资源密集型且难以准确地执行。 PR的复杂多种子质性质和我们对其遗传基础的不完全理解使得难以使用抗性决定簇的检测来预测PR。因此,我们将机器学习(ML)应用于来自> 600克利布拉肺炎克隆克隆组258(CG258)基因组的全基因组测序数据,以预测表型PR。使用基于基于基于基于基于基于基于基于的基于基于ML的基于规则的方法,该方法是检测已知的PR基因中的变体(接收器 - 操作符曲线[Auroc]下的区域,0.894与0.791,p≤x≤0.006)。我们注意到通过使用细菌基因组 - 宽的关联研究来筛选相关基因组特征,并通过以先前的多粘菌素暴露的形式集成临床资料来增加性能。相反,作为K-MERS的基因组数据的无基因表示与性能降低有关(Auroc,0.692与0.894,p?= 0.015)。当解释ML模型提取基因组特征时,通过未提前编程的模型正确鉴定七种已知的PR基因中的6种,并且鉴定了涉及应力响应和维持细胞膜的几种基因作为Pr的潜在新型决定簇。这些发现是概念证据,即全基因组测序数据可以准确地预测K.肺炎肺肺肺痘痘CG258,并且可以应用于其他形式的复杂抗微生物抗性。重要性多肾上腺素是用于治疗高度抗性革兰氏阴性细菌的最后一个抗生素。多种多辛抗性的报道越来越多,提高了Postantibiotic时代的担忧。因此,聚辛抗性是具有重要的公共卫生威胁,但目前的检测表型方法难以执行。已经越来越努力使用全基因组测序来检测抗生素抗性,但由于其复杂的多基因性质,这一直难以适用于多脂素抗性。我们的研究的重要性是,我们成功地应用了机器学习方法,以预测Klebsiella肺炎克隆群258中的多辛抗性,一种常见的医疗保健相关和多药物抗性病原体。我们的研究结果强调,即使以复杂的抗生素耐药性,也可以成功地应用机器学习,并且代表了对可用于预测其他细菌和其他抗生素的抗性的重要贡献。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号