首页> 外文会议>International conference on web engineering >Identifying Candidate Datasets for Data Interlinking
【24h】

Identifying Candidate Datasets for Data Interlinking

机译:确定用于数据互连的候选数据集

获取原文

摘要

One of the design principles that can stimulate the growth and increase the usefulness of the Web of data is URIs linkage. However, the related URIs are typically in different datasets managed by different publishers. Hence, the designer of a new dataset must be aware of the existing datasets and inspect their content to define same As links. This paper proposes a technique based on probabilistic classifiers that, given a datasets S to be published and a set T of known published datasets, ranks each T_i ∈ T according to the probability that links between S and T_i can be found by inspecting the most relevant datasets. Results from our technique show that the search space can be reduced up to 85%, thereby greatly decreasing the computational effort.
机译:URI链接是可以刺激数据网增长并提高其实用性的设计原则之一。但是,相关的URI通常位于不同发布者管理的不同数据集中。因此,新数据集的设计者必须了解现有数据集并检查其内容以定义相同的As链接。本文提出了一种基于概率分类器的技术,在给定要发布的数据集S和已知的已发布数据集T的情况下,通过检查最相关的S和T_i之间的链接的可能性,对每个T_i∈T进行排名数据集。我们技术的结果表明,搜索空间可以减少多达85%,从而大大减少了计算量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号