首页> 外文期刊>Machine Learning >Constructing generative logical models for optimisation problems using domain knowledge
【24h】

Constructing generative logical models for optimisation problems using domain knowledge

机译:使用域知识构建用于优化问题的生成逻辑模型

获取原文
获取原文并翻译 | 示例
       

摘要

In this paper we seek to identify data instances with a low value of some objective (or cost) function. Normally posed as optimisation problems, our interest is in problems that have the following characteristics: (a) optimal, or even near-optimal solutions are very rare; (b) it is expensive to obtain the value of the objective function for large numbers of data instances; and (c) there is domain knowledge in the form of experience, rules-of-thumb, constraints and the like, which is difficult to translate into the usual constraints for numerical optimisation procedures. Here we investigate the use of Inductive Logic Programming (ILP) to construct models within a procedure that progressively attempts to increase the number of near-optimal solutions. Using ILP in this manner requires a change in focus from discriminatory models (the usual staple for ILP) to generative models. Using controlled datasets, we investigate the use of probability-sampling of solutions based on the estimated cost of clauses found using ILP. Specifically, we compare the results obtained against: (a) simple random sampling; and (b) generative deep network models that use a low-level encoding and automatically construct higher-level features. Our results suggest: (1) Against each of the alternatives, probability-sampling from ILP-constructed models contain more near-optimal solutions; (2) The key to the effectiveness of ILP-constructed models is the availability of domain knowledge. We also demonstrate the use of ILP in this manner on two real-world problems from the area of drug-design (predicting solubility and binding affinity), using domain knowledge of chemical ring structures and functional groups. Taken together, our results suggest that generative modelling using ILP can be very effective for optimisation problems where: (a) the number of training instances to be used is restricted, and (b) there is domain knowledge relevant to low-cost solutions.
机译:在本文中,我们寻求以低价值的某些目标(或成本)函数的数据实例。通常构成为优化问题,我们的兴趣是存在以下特点的问题:(a)最佳,甚至近最佳解决方案非常罕见; (b)获得大量数据实例的目标函数的值是昂贵的; (c)以经验,拇指规则,约束等的形式存在域知识,这很难转化为数值优化程序的平常约束。在这里,我们调查使用感应逻辑编程(ILP)在逐步尝试增加近最优解决方案的数量的过程中构建模型。以这种方式使用ILP需要将焦点的变化从歧视模型(ILP通常的钉)到生成模型。使用受控数据集,我们研究了基于使用ILP发现的条款的估计成本的解决方案概率抽样。具体而言,我们比较反对的结果:(a)简单的随机抽样; (b)使用低级编码的生成深网络模型,并自动构建更高级别的功能。我们的结果表明:(1)针对每种替代方案,ILP构造模型的概率抽样包含更多近最佳解决方案; (2)ILP构建模型有效性的关键是域知识的可用性。我们还通过使用域的化学环结构和官能团的结构域知识来证明在这种方式上以这种方式展示了以这种方式使用来自这种实际问题(预测溶解性和结合亲和力)的真实问题。我们的结果表明,使用ILP的生成建模对于优化问题非常有效:(a)所使用的培训实例的数量受到限制,并且(b)有与低成本解决方案相关的域知识。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号