首页> 外文会议>International Conference on Data Engineering >Simplifying Entity Resolution on Web Data with Schema-agnostic, Non-iterative Matching
【24h】

Simplifying Entity Resolution on Web Data with Schema-agnostic, Non-iterative Matching

机译:使用架构 - 不可知的Web数据简化实体分辨率,非迭代匹配

获取原文

摘要

Entity Resolution (ER) aims to identify different descriptions in various Knowledge Bases (KBs) that refer to the same entity. ER is challenged by the Variety, Volume and Veracity of descriptions published in the Web of Data. To address them, we propose the MinoanER framework that fulfills full automation and support of highly heterogeneous entities. MinoanER leverages a token-based similarity of entities to define a new metric that derives the similarity of neighboring entities from the most important relations, indicated only by statistics. For high efficiency, similarities are computed from a set of schema-agnostic blocks and processed in a non-iterative way that involves four threshold-free heuristics. We demonstrate that the effectiveness of MinoanER is comparable to existing ER tools over real KBs exhibiting low heterogeneity in terms of entity types and content. Yet, MinoanER outperforms state-of-the-art ER tools when matching highly heterogeneous KBs.
机译:实体分辨率(ER)旨在识别引用同一实体的各种知识库(KBS)中的不同描述。呃受到数据网络中发布的描述的品种,体积和真实性的挑战。要解决它们,我们提出了符合高度异构实体的完全自动化和支持的MinoAler框架。 MINOANER利用实体的基于令牌的相似性来定义一个新的度量标准,从最重要的关系中源于来自最重要的关系,仅通过统计表示。为了高效率,从一组模式 - 不可知块计算相似性,并以非迭代方式处理,涉及四个阈值的启发式。我们表明,MINOANER的有效性与现有的ER工具与实体类型和内容方面的实际KBS上具有低异质性。然而,在匹配高度异质的KBS时,Minoaner优于最先进的ER工具。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号