首页> 外文会议>International Conference on Computational Intelligence >A New Computational Approach to Identify Essential Genes in Bacterial Organisms Using Machine Learning
【24h】

A New Computational Approach to Identify Essential Genes in Bacterial Organisms Using Machine Learning

机译:一种新的计算方法,可以使用机器学习鉴定细菌生物中必需基因

获取原文

摘要

Essential genes of an organism are those genes that are required for the growth to a fertile adult and is pivotal for the survival of an organism. In this study, a new computational approach based on machine learning method is designed which can constructively project essential genes by integration of homologous, gene intrinsic, and network topology features. A set of 15 bacterial organisms as reference species have been used which have characterized essential genes. By applying "Extreme Gradient Boosting (XGBoost)" for Bacillus Subtilis 168, the classification model through tenfold cross-validation test gave average AUC value of 0.9649. Further applying this new model to a closely related organism Salmonella enterica serovar Typhimurium LT2 resulted in a very definitive AUC value of 0.8608. To assess the stability and consistency of the proposed classifier, a different set of target organisms comprised of Escherichia coli MG1655 and Streptococcus sanguinis SK36 and another classifier based on PCR method were implemented. The performance of the model based on principal component regression (PCR) method for both set of target organisms resulted in lower AUC values. It shows that the newly designed feature-integrated approach based on XGBoost method results in better predictive accuracy to identify essential genes.
机译:生物体的必需基因是那些对生长成年人生长所需的基因,并且是生物体的存活率的关键。在本研究中,设计了一种基于机器学习方法的新计算方法,其设计通过集成同源,基因内在和网络拓扑特征来建设性地项目基因。已经使用了一组15种细菌生物作为参考物种,其具有表征基因。通过对枯草芽孢杆菌168施加“极端梯度升压(XGBoost)”,通过十倍交叉验证测试的分类模型得到了0.9649的平均AUC值。进一步将该新模型进一步应用于密切相关的有机体沙门氏菌肠道血吸虫LT2导致非常明确的AUC值为0.8608。为了评估所提出的分类剂的稳定性和一致性,实施了由大肠杆菌Mg1655和链球菌SK36和基于PCR方法的群体组成的不同一组靶生物。基于主成分回归(PCR)方法对两组靶生物的模型的性能导致较低的AUC值。它表明,基于XGBoost方法的新设计的特征综合方法导致更好的预测准确性来识别必要基因。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号