...
首页> 外文期刊>Semantic web >Data-driven assessment of structural evolution of RDF graphs
【24h】

Data-driven assessment of structural evolution of RDF graphs

机译:RDF图结构演化的数据驱动评估

获取原文
获取原文并翻译 | 示例
           

摘要

Since the birth of the Semantic Web, numerous knowledge bases have appeared. The applications that exploit them rely on the quality of their data through time. In this regard, one of the main dimensions of data quality is conformance to the expected usage of the vocabulary. However, the vocabulary usage (i.e., how classes and properties are actually populated) can vary from one base to another. Moreover, through time, such usage can evolve within a base and diverge from the previous practices. Methods have been proposed to follow the evolution of a knowledge base by the observation of the changes of their intentional schema (or ontology); however, they do not capture the evolution of their actual data, which can vary greatly in practice. In this paper, we propose a data-driven approach to assess the global evolution of vocabulary usage in large RDF graphs. Our proposal relies on two structural measures defined at different granularities (dataset vs update), which are based on pattern mining techniques. We have performed a thorough experimentation which shows that our approach is scalable, and can capture structural evolution through time of both synthetic (LUBM) and real knowledge bases (different snapshots and updates of DBpedia).
机译:自从语义网络的诞生以来,似乎有许多知识库。利用它们通过时间依赖其数据质量的应用程序。在这方面,数据质量的主要方面之一是符合词汇的预期使用。但是,词汇用法(即,实际填充的类和属性如何)可以从一个基础都不同于另一个基础。此外,通过时间,这种用法可以在基础上发展并从先前的实践中发散。已经提出了通过观察其故意模式(或本体论)的变化来遵循知识库的演变;但是,它们不会捕捉到他们实际数据的演变,这可以在实践中大大变化。在本文中,我们提出了一种数据驱动的方法来评估大RDF图中的词汇使用的全球演变。我们的提案依赖于在不同粒度(DataSet VS更新)中定义的两个结构措施,这是基于模式挖掘技术。我们已经进行了彻底的实验,表明我们的方法是可扩展的,并且可以通过合成(LUBM)和真实知识库(DBPedia的不同快照和更新)来捕获结构演变。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号