【24h】

S-RDF: A New RDF Serialization Format for Better Storage Without Losing Human Readability

机译:S-RDF:一种新的RDF序列化格式,可在不损失人类可读性的情况下实现更好的存储

获取原文

摘要

Nowadays, RDF data becomes more and more popular on the Web due to the advances of the Semantic Web and the Linked Open Data initiatives. Several works are focused on transforming relational databases to RDF by storing related data in N-Triple serialization format. However, these approaches do not take into account the existing normalization of their databases since N-Triple format allows data redundancy and does not control any normalization by itself. Moreover, the mostly used and recommended serialization formats, such as RDF/XML, Turtle, and HDT, have either high human-readability but waste storage capacity, or focus further on storage capacities while providing low human-readability. To overcome these limitations, we propose here a new serialization format, called S-RDF. By considering the structure (graph) and values of the RDF data separately, S-RDF reduces the duplicity of values by using unique identifiers. Results show an important improvement over the existing serialization formats in terms of storage (up to 71,66% w.r.t. N-Triples) and human readability.
机译:如今,由于语义Web和链接开放数据计划的发展,RDF数据在Web上变得越来越流行。通过以N-Triple序列化格式存储相关数据,一些工作致力于将关系数据库转换为RDF。但是,这些方法未考虑其数据库的现有规范化,因为N-Triple格式允许数据冗余,并且本身无法控制任何规范化。此外,最常用和推荐的序列化格式(例如RDF / XML,Turtle和HDT)具有较高的可读性,但浪费了存储容量,或者在提供较低的可读性的同时进一步关注存储容量。为了克服这些限制,我们在这里提出一种新的序列化格式,称为S-RDF。通过分别考虑RDF数据的结构(图形)和值,S-RDF通过使用唯一标识符来减少值的重复性。结果表明,与现有的序列化格式相比,在存储方面(w.r.t. N-Triples高达71.66%)和人类可读性有了重大改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号