首页> 外文会议>International Conference on Computational Intelligence in Data Science >Candidate Generation for Instance Matching on Semantic Web
【24h】

Candidate Generation for Instance Matching on Semantic Web

机译:候选生成例如在语义web上匹配

获取原文

摘要

The growth of semantic web has given rise to proliferation of data sources wherein the task of recognizing real world entities and identifying multiple references of the same real world entity becomes an essential task in order to facilitate sharing and integration of data. Due to the heterogeneous nature of data on the semantic web, entities belonging to different sources are compared by assessing the similarity of features that are common in order to identify matches. With the increasing size of data sets Candidate generation methods are generally employed to avoid quadratic time complexity that would otherwise be incurred if pairwise similarity of all entities are computed. Here we propose a novel index based approach for candidate generation and reduction. The evaluation shows that the proposed method scales well and improves recall significantly.
机译:语义Web的生长已经引起数据来源的扩散,其中识别现实世界实体和识别相同现实世界实体的多个引用的任务成为必不可少的任务,以便于共享和集成数据。由于语义网络上的数据的异构性,通过评估常见的特征的相似性来比较属于不同来源的实体,以识别匹配。随着数据集的增加,候选生成方法通常用于避免在计算所有实体的成对相似性,否则将被产生的二次时间复杂度。在这里,我们提出了一种基于候选生成和减少的基于索引的索引方法。评估表明,所提出的方法衡量良好并显着提高回忆。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号