首页> 外文会议>International semantic web conference >SAKey: Scalable Almost Key Discovery in RDF Data
【24h】

SAKey: Scalable Almost Key Discovery in RDF Data

机译:SAKey:RDF数据中的可伸缩的近乎密钥发现

获取原文

摘要

Exploiting identity links among RDF resources allows applications to efficiently integrate data. Keys can be very useful to discover these identity links. A set of properties is considered as a key when its values uniquely identify resources. However, these keys are usually not available. The approaches that attempt to automatically discover keys can easily be overwhelmed by the size of the data and require clean data. We present SAKey, an approach that discovers keys in RDF data in an efficient way. To prune the search space, SAKey exploits characteristics of the data that are dynamically detected during the process. Furthermore, our approach can discover keys in datasets where erroneous data or duplicates exist (i.e., almost keys). The approach has been evaluated on different synthetic and real datasets. The results show both the relevance of almost keys and the efficiency of discovering them.
机译:利用RDF资源之间的身份链接可以使应用程序有效地集成数据。密钥对于发现这些身份链接非常有用。当一组属性的值唯一地标识资源时,它被视为键。但是,这些键通常不可用。试图自动发现密钥的方法很容易被数据的大小所淹没,并需要干净的数据。我们介绍SAKey,这是一种以有效方式发现RDF数据中密钥的方法。为了修剪搜索空间,SAKey利用了在此过程中动态检测到的数据的特征。此外,我们的方法可以在存在错误数据或重复项(即几乎是密钥)的数据集中发现密钥。该方法已在不同的综合和真实数据集上进行了评估。结果显示了几乎关键的相关性和发现它们的效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号