首页> 外文会议>The semantic web - ISWC 2009 >Graph-Based Ontology Construction from Heterogenous Evidences
【24h】

Graph-Based Ontology Construction from Heterogenous Evidences

机译:基于异构证据的基于图的本体构建

获取原文
获取原文并翻译 | 示例

摘要

Ontologies are tools for describing and structuring knowledge, with many applications in searching and analyzing complex knowledge bases. Since building them manually is a costly process, there are various approaches for bootstrapping ontologies automatically through the analysis of appropriate documents. Such an analysis needs to find the concepts and the relationships that should form the ontology. However, since relationship extraction methods are imprecise and cannot homogeneously cover all concepts, the initial set of relationships is usually inconsistent and rather unbalanced - a problem which, to the best of our knowledge, was mostly ignored so far. In this paper, we define the problem of extracting a consistent as well as properly structured ontology from a set of inconsistent and heterogeneous relationships. Moreover, we propose and compare three graph-based methods for solving the ontology extraction problem. We extract relationships from a large-scale data set of more than 325K documents and evaluate our methods against a gold standard ontology comprising more than 12K relationships. Our study shows that an algorithm based on a modified formulation of the dominating set problem outperforms greedy methods.
机译:本体是用于描述和构造知识的工具,在搜索和分析复杂知识库中有许多应用。由于手动构建它们是一个昂贵的过程,因此有多种方法可以通过分析适当的文档来自动引导本体。这样的分析需要找到应该构成本体的概念和关系。但是,由于关系提取方法不精确并且不能均匀地涵盖所有概念,因此最初的关系集通常是不一致的,甚至是不平衡的,就我们所知,到目前为止,这个问题大部分都被忽略了。在本文中,我们定义了从一组不一致且异构的关系中提取一致且结构正确的本体的问题。此外,我们提出并比较了三种基于图的解决本体提取问题的方法。我们从超过325K文档的大规模数据集中提取关系,并针对包含超过12K关系的金标准本体评估我们的方法。我们的研究表明,基于控制集问题的改进公式的算法优于贪婪方法。

著录项

  • 来源
    《The semantic web - ISWC 2009》|2009年|P.81-96|共16页
  • 会议地点 Chantilly VA(US);Chantilly VA(US)
  • 作者单位

    Hasso-Plattner-Institut, Prof.- Dr.-Helmert-Str. 2-3, 14482 Potsdam, Germany;

    rnHumboldt-Universitaet zu Berlin, Unter den Linden 6, 10099 Berlin, Germany;

    rnHumboldt-Universitaet zu Berlin, Unter den Linden 6, 10099 Berlin, Germany;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 计算机网络;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号