首页> 外文会议>ESWC 2014;Extended Semantic Web Conference >Generating Synthetic RDF Data with Connected Blank Nodes for Benchmarking
【24h】

Generating Synthetic RDF Data with Connected Blank Nodes for Benchmarking

机译:使用连接的空白节点生成合成RDF数据,用于基准测试

获取原文
获取外文期刊封面目录资料

摘要

Generators for synthetic RDF datasets are very important for testing and benchmarking various semantic data management tasks (e.g. querying, storage, update, compare, integrate). However, the current generators do not support sufficiently (or totally ignore) blank node connectivity issues. Blank nodes are used for various purposes (e.g. for describing complex attributes), and a significant percentage of resources is currently represented with blank nodes. Moreover, several semantic data management tasks, like isomorphism checking (useful for checking equivalence), and blank node matching (useful in comparison, versioning, synchronization, and in semantic similarity functions), not only have to deal with blank nodes, but their complexity and optimality depends on the connectivity of blank nodes. To enable the comparative evaluation of the various techniques for carrying out these tasks, in this paper we present the design and implementation of a generator, called BGen, which allows building datasets containing blank nodes with the desired complexity, controllable through various features (morphology, size, diameter, density and clustering coefficient). Finally, the paper reports experimental results concerning the efficiency of the generator, as well as results from using the generated datasets, that demonstrate the value of the generator.
机译:用于合成RDF数据集的生成器对于测试和基准测试各种语义数据管理任务非常重要(例如查询,存储,更新,比较,集成)。但是,当前发生器不充分支持(或完全忽略)空白节点连接问题。空白节点用于各种目的(例如,用于描述复杂属性),并且当前用空白节点表示大量资源。此外,几个语义数据管理任务,如同义检查(用于检查等价物)和空白节点匹配(在比较,版本控制,同步和语义相似度函数中有用),不仅必须处理空白节点,而是它们的复杂性并且最优性取决于空白节点的连接。为了使得能够对执行这些任务的各种技术的比较评估,本文介绍了一个名为BGEN的发电机的设计和实现,其允许通过各种特征(形态)控制包含所需复杂度的数据集(形态,尺寸,直径,密度和聚类系数)。最后,该论文报告了关于发电机效率的实验结果,以及使用生成的数据集的结果,这证明了发电机的值。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号