首页> 外文会议>International conference on web-age information management >Identifying Linked Data Datasets for sameAs Interlinking Using Recommendation Techniques
【24h】

Identifying Linked Data Datasets for sameAs Interlinking Using Recommendation Techniques

机译:使用推荐技术为SameA链接识别链接数据数据集

获取原文

摘要

Due to the outstanding role of owl:sameAs as the most widely used linking predicate, the problem of identifying potential Linked Data datasets for sameAs interlinking was studied in this paper. The problem was regarded as a Recommender systems problem, so serveral classical collaborative filtering techniques were employed. The user-item matrix was constructed with rating values defined depending on the number of owl:sameAs RDF links between datasets from Linked Open Data Cloud 2014 dump. The similarity measure is a key for memory-based collaborative filtering methods, a novel dataset semantic similarity measure was proposed based on the vocabulary information extracted from datasets. We conducted experiments to evaluate the accuracy of both the predicted ratings and recommended datasets lists of these recommenders. The experiments demonstrated that our customized recommenders outperformed the original ones with a great deal, and achieved much better metrics in both evaluations.
机译:由于owl:sameAs作为最广泛使用的链接谓词的杰出作用,本文研究了为sameAs链接确定潜在链接数据集的问题。该问题被视为Recommender系统问题,因此采用了服务器经典协作过滤技术。用户项目矩阵的构造使用根据Linked Open Data Cloud 2014转储的数据集之间的owl:sameAs RDF链接的数量定义的评估值来构建。相似度度量是基于内存的协同过滤方法的关键,基于从数据集中提取的词汇信息,提出了一种新的数据集语义相似度度量。我们进行了实验,以评估这些推荐者的预测评分和推荐数据集列表的准确性。实验表明,我们的定制推荐器在很大程度上优于原始推荐器,并且在两次评估中均获得了更好的指标。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号