首页> 美国卫生研究院文献>Molecules >Extremely Randomized Machine Learning Methods for Compound Activity Prediction
【2h】

Extremely Randomized Machine Learning Methods for Compound Activity Prediction

机译:复合活动预测的极端随机机器学习方法

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Speed, a relatively low requirement for computational resources and high effectiveness of the evaluation of the bioactivity of compounds have caused a rapid growth of interest in the application of machine learning methods to virtual screening tasks. However, due to the growth of the amount of data also in cheminformatics and related fields, the aim of research has shifted not only towards the development of algorithms of high predictive power but also towards the simplification of previously existing methods to obtain results more quickly. In the study, we tested two approaches belonging to the group of so-called ‘extremely randomized methods’—Extreme Entropy Machine and Extremely Randomized Trees—for their ability to properly identify compounds that have activity towards particular protein targets. These methods were compared with their ‘non-extreme’ competitors, i.e., Support Vector Machine and Random Forest. The extreme approaches were not only found out to improve the efficiency of the classification of bioactive compounds, but they were also proved to be less computationally complex, requiring fewer steps to perform an optimization procedure.
机译:速度,对计算资源的相对较低的要求以及化合物生物活性评估的高效率已引起人们对将机器学习方法应用于虚拟筛选任务的兴趣迅速增长。然而,由于化学信息学和相关领域中数据量的增长,研究的目标不仅转向了具有高预测能力的算法的开发,而且也转向了简化现有方法以更快地获得结果的方法。在这项研究中,我们测试了属于“极端随机方法”类别的两种方法-极端熵机器和极端随机树-能够正确识别对特定蛋白质靶具有活性的化合物的能力。将这些方法与他们的“非极端”竞争对手(即支持向量机和随机森林)进行了比较。不仅找到了提高生物活性化合物分类效率的极端方法,而且还证明了它们的计算复杂度较低,需要较少的步骤来执行优化程序。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号