首页> 外文期刊>Journal of applied microbiology >Capreomycin resistance prediction in two species of iMycobacterium/i using a stacked ensemble method
【24h】

Capreomycin resistance prediction in two species of iMycobacterium/i using a stacked ensemble method

机译:Capreomycin resistance prediction in two species of iMycobacterium/i using a stacked ensemble method

获取原文
获取原文并翻译 | 示例
       

摘要

titleAbstract/titlesectiontitleAims/titlepPredicting bacterial resistance provides valuable information that can assist in clinical decisions. With recent advances in whole genome sequencing technology, the detection of antibiotic resistance (AR) proteins directly from genomic data is becoming feasible. AR genes/proteins can be identified using best‐hit methods that work by comparing candidate sequences with known AR genes in public databases. However, these approaches may fail to detect resistance genes with sequences that differ significantly from known sequences. Our goal is to develop a machine learning technique to accurately predict capreomycin resistance in iMycobacteria/i with low false discovery rates./p/sectionsectiontitleMethods and Results/titlepWe present a stacked ensemble learning model as an alternative to traditional DNA sequence alignment‐based methods using optimal features generated from the physicochemical, evolutionary and secondary structure properties of protein sequences. We train logistic regression, C5.0 and support vector machine (SVM) algorithms as our base classifiers, and our stacked ensemble predictors combine the results from the base classifiers to achieve higher accuracy. Compared with our most accurate base classifier (SVM), our most accurate stacked ensemble predictor increases training accuracy by 2·43. Our stacked ensemble predictors achieve test accuracy up to 81·25./p/sectionsectiontitleConclusions/titlepWe developed a stacked ensemble model to predict capreomycin resistance for iMycobacteria/i with an accuracy 80 using protein sequences with sequence similarity ranging between 10 and 70. This performance cannot be achieved with best‐hit methods due to differences in sequence similarity./p/sectionsectiontitleSignificance and Impact of the Study/titlepToday an estimated one‐half million cases of multidrug‐resistant (MDR) and extensively drug‐resistant (XDR) tuberculosis (TB) occur annually worldwide at a great cost. Because capreomycin is a second‐line drug used to treat drug‐resistant TB, the ability to use a machine learning approach to classify capreomycin‐resistant TB in a timely manner is crucial for the successful treatment of MDR or XDR TB./p/section

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号