首页> 外文会议>International conference on web information systems engineering >Two Approaches to the Dataset Interlinking Recommendation Problem
【24h】

Two Approaches to the Dataset Interlinking Recommendation Problem

机译:数据集互连推荐问题的两种方法

获取原文

摘要

Whenever a dataset t is published on the Web of Data, an exploratory search over existing datasets must be performed to identify those datasets that are potential candidates to be interlinked with t. This paper introduces and compares two approaches to address the dataset interlinking recommendation problem, respectively based on Bayesian classifiers and on Social Network Analysis techniques. Both approaches define rank score functions that explore the vocabularies, classes and properties that the datasets use, in addition to the known dataset links. After extensive experiments using real-world datasets, the results show that the rank score functions achieve a mean average precision of around 60%. Intuitively, this means that the exploratory search for datasets to be interlinked with t might be limited to just the top-ranked datasets, reducing the cost of the dataset interlinking process.
机译:每当将数据集t发布到Web数据网上时,都必须对现有数据集进行探索性搜索,以识别那些可能与t关联的数据集。本文分别介绍和比较了两种基于贝叶斯分类器和社交网络分析技术解决数据集链接推荐问题的方法。两种方法都定义了排名分数函数,除了已知的数据集链接外,还可以探索数据集使用的词汇,类别和属性。在使用现实世界的数据集进行了广泛的实验之后,结果表明,排名得分函数的平均平均精度约为60%。直观上,这意味着探索性搜索将与t链接的数据集可能仅限于排名靠前的数据集,从而降低了数据集链接过程的成本。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号