...
首页> 外文期刊>Semantic web >Quality metrics for RDF graph summarization
【24h】

Quality metrics for RDF graph summarization

机译:RDF图摘要的质量指标

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

RDF Graph Summarization pertains to the process of extracting concise but meaningful summaries from RDF Knowledge Bases (KBs) representing as close as possible the actual contents of the KB both in terms of structure and data. RDF Summarization allows for better exploration and visualization of the underlying RDF graphs, optimization of queries or query evaluation in multiple steps, better understanding of connections in Linked Datasets and many other applications. In the literature, there are efforts reported presenting algorithms for extracting summaries from RDF KBs. These efforts though provide different results while applied on the same KB, thus a way to compare the produced summaries and decide on their quality and best-fitness for specific tasks, in the form of a quality framework, is necessary. So in this work, we propose a comprehensive Quality Framework for RDF Graph Summarization that would allow a better, deeper and more complete understanding of the quality of the different summaries and facilitate their comparison. We work at two levels: the level of the ideal summary of the KB that could be provided by an expert user and the level of the instances contained by the KB. For the first level, we are computing how close the proposed summary is to the ideal solution (when this is available) by defining and computing its precision, recall and F-measure against the ideal solution. For the second level, we are computing if the existing instances are covered (i.e. can be retrieved) and at which degree by the proposed summary. Again we define and compute its precision, recall and F-measure against the data contained in the original KB. We also compute the connectivity of the proposed summary compared to the ideal one, since in many cases (like, e.g., when we want to query) this is an important factor and in general in RDF, linked datasets are usually used. We use our quality framework to test the results of three of the best RDF Graph Summarization algorithms, when summarizing different (in terms of content) and diverse (in terms of total size and number of instances, classes and predicates) KBs and we present comparative results for them. We conclude this work by discussing these results and the suitability of the proposed quality framework in order to get useful insights for the quality of the presented results.
机译:RDF图摘要涉及从RDF知识库(KBS)提取简洁但有意义的摘要的过程尽可能接近结构和数据的实际内容。 RDF摘要允许更好地探索和可视化底层的RDF图形,在多个步骤中优化查询或查询评估,更好地了解链接数据集中的连接和许多其他应用程序。在文献中,有努力报告​​用于从RDF KBS提取摘要的呈现算法。这些努力虽然提供了不同的结果,同时应用于同一KB,因此需要一种方法来比较所产生的摘要,并以质量框架的形式对所产生的摘要进行决定其质量和最佳健康状况。因此,在这项工作中,我们向RDF图表摘要提出了一个全面的质量框架,这将使对不同摘要的质量更好,更深入地了解并促进他们的比较。我们在两个级别工作:专家用户可以提供的KB的理想摘要级别以及KB所含实例的级别。对于第一级,我们正在计算所提出的摘要是如何通过定义和计算其精确度,召回和用于理想解决方案的精度,召回和F测量来关闭理想的解决方案(当可用时)。对于第二级,我们计算现有实例是否覆盖(即可以检索),并在该摘要中以哪个程度。我们再次定义并计算其精确度,召回和F测量对原始KB中包含的数据。与理想的,我们还计算所提出的摘要的连接,因为在许多情况下(例如,当我们想要查询时),这是一个重要因素,并且通常在RDF中,通常使用链接数据集。我们使用质量框架来测试三个最佳RDF图摘要算法的结果,总结了不同(在内容方面)和多样化(在总体规模和课程,类别和谓词方面)KBS和我们呈现比较结果。我们通过讨论这些结果以及所提出的质量框架的适用性来结束这项工作,以便对所提出的结果质量有用的见解。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号