Quality metrics for RDF graph summarization

Zneika Mussab; Vodislav Dan; Kotzinos Dimitris

首页> 外文期刊>Semantic web >Quality metrics for RDF graph summarization

【24h】

Quality metrics for RDF graph summarization

机译：RDF图摘要的质量指标

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

RDF Graph Summarization pertains to the process of extracting concise but meaningful summaries from RDF Knowledge Bases (KBs) representing as close as possible the actual contents of the KB both in terms of structure and data. RDF Summarization allows for better exploration and visualization of the underlying RDF graphs, optimization of queries or query evaluation in multiple steps, better understanding of connections in Linked Datasets and many other applications. In the literature, there are efforts reported presenting algorithms for extracting summaries from RDF KBs. These efforts though provide different results while applied on the same KB, thus a way to compare the produced summaries and decide on their quality and best-fitness for specific tasks, in the form of a quality framework, is necessary. So in this work, we propose a comprehensive Quality Framework for RDF Graph Summarization that would allow a better, deeper and more complete understanding of the quality of the different summaries and facilitate their comparison. We work at two levels: the level of the ideal summary of the KB that could be provided by an expert user and the level of the instances contained by the KB. For the first level, we are computing how close the proposed summary is to the ideal solution (when this is available) by defining and computing its precision, recall and F-measure against the ideal solution. For the second level, we are computing if the existing instances are covered (i.e. can be retrieved) and at which degree by the proposed summary. Again we define and compute its precision, recall and F-measure against the data contained in the original KB. We also compute the connectivity of the proposed summary compared to the ideal one, since in many cases (like, e.g., when we want to query) this is an important factor and in general in RDF, linked datasets are usually used. We use our quality framework to test the results of three of the best RDF Graph Summarization algorithms, when summarizing different (in terms of content) and diverse (in terms of total size and number of instances, classes and predicates) KBs and we present comparative results for them. We conclude this work by discussing these results and the suitability of the proposed quality framework in order to get useful insights for the quality of the presented results.

机译：RDF图摘要涉及从RDF知识库（KBS）提取简洁但有意义的摘要的过程尽可能接近结构和数据的实际内容。 RDF摘要允许更好地探索和可视化底层的RDF图形，在多个步骤中优化查询或查询评估，更好地了解链接数据集中的连接和许多其他应用程序。在文献中，有努力报告用于从RDF KBS提取摘要的呈现算法。这些努力虽然提供了不同的结果，同时应用于同一KB，因此需要一种方法来比较所产生的摘要，并以质量框架的形式对所产生的摘要进行决定其质量和最佳健康状况。因此，在这项工作中，我们向RDF图表摘要提出了一个全面的质量框架，这将使对不同摘要的质量更好，更深入地了解并促进他们的比较。我们在两个级别工作：专家用户可以提供的KB的理想摘要级别以及KB所含实例的级别。对于第一级，我们正在计算所提出的摘要是如何通过定义和计算其精确度，召回和用于理想解决方案的精度，召回和F测量来关闭理想的解决方案（当可用时）。对于第二级，我们计算现有实例是否覆盖（即可以检索），并在该摘要中以哪个程度。我们再次定义并计算其精确度，召回和F测量对原始KB中包含的数据。与理想的，我们还计算所提出的摘要的连接，因为在许多情况下（例如，当我们想要查询时），这是一个重要因素，并且通常在RDF中，通常使用链接数据集。我们使用质量框架来测试三个最佳RDF图摘要算法的结果，总结了不同（在内容方面）和多样化（在总体规模和课程，类别和谓词方面）KBS和我们呈现比较结果。我们通过讨论这些结果以及所提出的质量框架的适用性来结束这项工作，以便对所提出的结果质量有用的见解。

著录项

来源
《Semantic web》 |2019年第3期|共30页
作者
Zneika Mussab; Vodislav Dan; Kotzinos Dimitris;
展开▼
作者单位

Paris Seine Univ ETIS UMR 8051 Paris France;

Paris Seine Univ ETIS UMR 8051 Paris France;

Paris Seine Univ ETIS UMR 8051 Paris France;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
Quality framework; quality metrics; RDF Summarization; linked Open Data; RDF query processing;

机译：质量框架;质量指标;RDF摘要;链接开放数据;RDF查询处理;

相似文献

外文文献
中文文献
专利

1. Quality metrics for RDF graph summarization [J] . Zneika Mussab, Vodislav Dan, Kotzinos Dimitris Semantic web . 2019,第3期

机译：RDF图摘要的质量指标
2. RDF graph summarization for first-sight structure discovery [J] . Goasdoue Francois, Guzewicz Pawel, Manolescu Ioana The VLDB journal . 2020,第5期

机译：一视图结构发现的RDF图摘要
3. Content-based Union and Complement Metrics for Dataset Search over RDF Knowledge Graphs [J] . ACM journal of data and information quality . 2020,第2期

机译：基于内容的联合和补充指标，用于数据集搜索RDF知识图表
4. Query-Oriented Summarization of RDF Graphs [C] . Sejla Cebiric, Francois Goasdoue, Ioana Manolescu International conference on very large data bases . 2015

机译：RDF图的面向查询的摘要
5. A study of graph partitioning techniques for fast indexing and query processing of a large RDF graph [D] . Barenkala, Dinesh 2013

机译：用于大型RDF图的快速索引和查询处理的图分区技术研究
6. Can Patient Record Summarization Support Quality Metric Abstraction? [O] . Rimma Pivovarov, Yael Judith Coppleson, Sharon Lipsky Gorman, 2016

机译：患者记录摘要可以支持质量指标抽象吗？
7. Query-Oriented Summarization of RDF Graphs [O] . Čebirić Šejla, Goasdoué François, Manolescu Ioana 2016

机译：RDF图的面向查询的汇总

Quality metrics for RDF graph summarization

摘要

著录项

相似文献

相关主题

期刊订阅