首页> 外文会议>ESWC 2014;Extended Semantic Web Conference >Unsupervised Link Discovery through Knowledge Base Repair
【24h】

Unsupervised Link Discovery through Knowledge Base Repair

机译:通过知识库修复无监督链接发现

获取原文

摘要

The Linked Data Web has developed into a compendium of partly very large datasets. Devising efficient approaches to compute links between these datasets is thus central to achieve the vision behind the Data Web. Unsupervised approaches to achieve this goal have emerged over the last few years. Yet, so far, none of these unsupervised approaches makes use of the replication of resources across several knowledge bases to improve the accuracy it achieves while linking. In this paper, we present COLIBRI, an iterative unsupervised approach for link discovery. COLIBRI allows the discovery of links between n datasets (n ≥ 2) while improving the quality of the instance data in these datasets. To this end, COLIBRI combines error detection and correction with unsupervised link discovery. We evaluate our approach on five benchmark datasets with respect to the F-score it achieves. Our results suggest that COLIBRI can significantly improve the results of unsupervised machine-learning approaches for link discovery while correctly detecting erroneous resources.
机译:链接数据Web已开发为部分非常大的数据集的汇编。因此,设计了在这些数据集之间计算链路的有效方法,因此是实现数据Web背后的视觉。在过去几年中出现了无监督的实现这一目标的方法。然而,到目前为止,这些无监督的方法都不是利用各种知识库的资源复制,以提高它在连接时实现的准确性。在本文中,我们展示了Colibri,这是一种迭代无人监督的链接发现方法。 Colibri允许发现N个数据集(n≥2)之间的链接,同时提高这些数据集中的实例数据的质量。为此,Colibri将错误检测和校正与无监督的链路发现相结合。我们在五个基准数据集中评估了它的方法,了解它实现的F分数。我们的研究结果表明,在正确检测错误资源的同时,Colibri可以显着提高无监督机器学习方法的结果,以便在正确检测错误。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号