首页> 外文OA文献 >Development of Predictive Models for Identifying Potential S100A9 Inhibitors Based on Machine Learning Methods
【2h】

Development of Predictive Models for Identifying Potential S100A9 Inhibitors Based on Machine Learning Methods

机译:基于机器学习方法的识别电位S100A9抑制器的预测模型的开发

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

S100A9 is a potential therapeutic target for various disease including prostate cancer, colorectal cancer, and Alzheimer's disease. However, the sparsity of atomic level data, such as protein-protein interaction of S100A9 with RAGE, TLR4/MD2, or CD147 (EMMPRIN) hinders the rational drug design of S100A9 inhibitors. Herein we first report predictive models of S100A9 inhibitory effect by applying machine learning classifiers on 2D-molecular descriptors. The models were optimized through feature selectors as well as classifiers to produce the top eight random forest models with robust predictability and high cost-effectiveness. Notably, optimal feature sets were obtained after the reduction of 2,798 features into dozens of features with the chopping of fingerprint bits. Moreover, the high efficiency of compact feature sets allowed us to further screen a large-scale dataset (over 6,000,000 compounds) within a week. Through a consensus vote of the top models, 46 hits (hit rate = 0.000713%) were identified as potential S100A9 inhibitors. We expect that our models will facilitate the drug discovery process by providing high predictive power as well as cost-reduction ability and give insights into designing novel drugs targeting S100A9.
机译:S100A9是各种疾病的潜在治疗靶标,包括前列腺癌,结直肠癌和阿尔茨海默病。然而,原子水平数据的稀疏性,例如S100A9的蛋白质 - 蛋白质与RAGE,TLR4 / MD2或CD147(EMMPRIN)的相互作用阻碍了S100A9抑制剂的合理药物设计。这里,我们首先通过在2D分子描述符上施加机器学习分类器来报告S100A9抑制作用的预测模型。该模型通过特征选择器以及分类器进行优化,以生产具有稳健的可预测性和高成本效益的前八个随机林模型。值得注意的是,在减少2,798个特征之后获得最佳特征集,其中具有斩波位的数十个特征。此外,紧凑型特征集的高效率允许我们在一周内进一步屏蔽大规模数据集(超过6,000,000种化合物)。通过顶级模型的共识投票,46次命中(命中率= 0.000713%)被鉴定为潜在的S100A9抑制剂。我们预计我们的模型将通过提供高预测力以及成本降低的能力来促进药物发现过程,并在设计靶向S100A9的新药的洞察中。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号