首页> 外文期刊>Future generation computer systems >Class expression induction as concept space exploration: From DL-FOIL to DL-FOCL
【24h】

Class expression induction as concept space exploration: From DL-FOIL to DL-FOCL

机译:类表达归纳作为概念空间探索:从DL-FOIL到DL-FOCL

获取原文
获取原文并翻译 | 示例
       

摘要

The Web of Data is one of the perspectives of the Semantic Web. In this context, concept learning services, supported by multirelational machine learning, have been integrated in various tools for knowledge engineers to carry out several tasks related to the construction, completion and maintenance of the knowledge bases: essentially they are used to elicit new candidate concept definitions (i.e. axioms regarding classes) to be incorporated in the knowledge bases possibly also as replacements for previous ones. Sundry reference approaches rely on a covering strategy to generalize input examples that can be regarded as a form of hill-climbing search that explores a huge discrete conceptual space. Methods adopting this strategy are known to be affected by an inherent problem of myopia. In particular, our DL-FOIL has been shown to suffer from this problem as its algorithm is based on a stochastic yet informed exploration of the concept space, by means of a refinement operator, to generate partial descriptions iteratively. To tackle this problem and enhance the performance of our system we have introduced a series of extensions of the original DL-FOIL algorithm, that have led to various releases of its spin-off DL-FOCL. Essentially they aim at reducing the aforementioned problem through specific strategies grounded on either the integration of meta-heuristics, such as repeated hill-climbing and tabu search, or the employment of some form of lookahead. In this work, we present consolidated and extended releases of both DL-FOIL and DL-FOCL along various dimensions: better heuristics and stop conditions, more complex refinement operators with the possibility to perform the specialization adopting iterative deepening or lookahead strategies, improved versions of the algorithm based on the repeated hill-climbing strategy with new quality criteria and of the tabu search with a different policy for managing the local memory. All the implementations of these approaches have been extensively evaluated in three experimental sessions, involving various publicly available knowledge bases and fragments extracted from the Linked Data cloud, showing interesting results and indicating some lessons to be learned: our approaches outperformed a popular reference system from the DL-Learner framework on learning problems when the open-world semantics is explicitly considered. They also exhibited an analogous performance on a benchmark of datasets from contexts with an intended underlying closed-world semantics.
机译:数据网是语义网的一种视角。在这种情况下,由多关系机器学习支持的概念学习服务已集成到各种工具中,以便知识工程师执行与知识库的构建,完成和维护有关的若干任务:本质上,它们用于引发新的候选概念。定义(即有关类的公理)可能会被并入知识库中,以替代先前的定义。杂项参考方法依靠覆盖策略来概括输入示例,这些示例可以看作是探索巨大离散概念空间的爬山搜索形式。已知采用这种策略的方法会受到近视眼固有问题的影响。特别是,我们的DL-FOIL已显示出遭受此问题的困扰,因为其算法基于对概念空间的随机且知情的探索,并借助细化运算符来迭代生成部分描述。为了解决此问题并增强我们系统的性能,我们引入了原始DL-FOIL算法的一系列扩展,从而导致了其衍生产品DL-FOCL的各种版本。本质上,它们旨在通过基于元启发式方法(例如反复爬山和禁忌搜索)或采用某种形式的先行策略的特定策略来减少上述问题。在这项工作中,我们将介绍DL-FOIL和DL-FOCL的合并和扩展版本,涉及各个方面:更好的试探法和停止条件,更复杂的提炼运算符,并有可能采用迭代加深或超前策略来执行专业化,该算法基于具有新质量标准的重复爬山策略和具有不同策略的禁忌搜索来管理本地内存。这些方法的所有实现已在三个实验会议中进行了广泛评估,涉及从链接数据云中提取的各种公开可用的知识库和片段,显示出有趣的结果并指出了一些要吸取的经验教训:我们的方法优于来自明确考虑开放世界语义时的学习问题的DL-Learner框架。他们还在具有预期的底层封闭世界语义的上下文数据集基准上表现出类似的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号