...
首页> 外文期刊>Journal of intelligent & fuzzy systems: Applications in Engineering and Technology >Linguistic summaries of graph datasets using ontologies: An application to Semantic Web
【24h】

Linguistic summaries of graph datasets using ontologies: An application to Semantic Web

机译:使用本体的图形数据集的语言摘要:语义Web的应用程序

获取原文
获取原文并翻译 | 示例
           

摘要

An approach to performing linguistic summaries of graph datasets, with particular focus on usage of ontologies is presented in this paper. This well-known mining technique is based on fuzzy set theory, which is used to model natural language words (e.g. 'many', 'tall'), and in result-generates natural-like sentences describing the data. Although intensely developed, before our work this method has been applied only to relational databases, while more and more data is available in graph model. A special case of such graph datasets is the Semantic Web, in which ontologies provide meaning, therefore enabling advanced machine learning. In our paper we analyze the problem of generating linguistic summaries for a graph data case (for which the method cannot be directly applied), with associated ontologies. The key element of ontologies are concept hierarchies, which are the core of our work. Firstly, due to heterogeneity and lack of schema we propose to use an ontological concept (including all sub-concepts in hierarchy) as a subject for summaries, and extract their attributes (neighboring vertexes). Then we show that by ascending these ontological concept hierarchies (so by attribute-based induction) we obtain additional, generalized summaries. We showthis process for both summarizers and qualifiers, and propose an extension to their respective imprecision measures -T-2 and T-9. We perform two experiments on DBPedia-one for summary subject 'Artist', and second for 'Musical Album'. For the latter, we show the optimized process of obtaining the truth values using bottom-up approach.
机译:本文介绍了执行图形数据集的语言摘要的方法,特别侧重于本文的使用。这种众所周知的挖掘技术基于模糊集理论,用于模拟自然语言词语(例如'许多','高'),并且在结果中生成描述数据的自然句子。虽然在我们的工作之前强烈开发,但这种方法仅应用于关系数据库,而图形模型中可以使用越来越多的数据。这种图形数据集的特殊情况是语义Web,其中本体提供意义,因此可以实现高级机器学习。在我们的论文中,我们分析了与关联的本体中的图形数据情况(对于无法直接应用的方法)生成语言摘要的问题。本体的关键元素是概念层次结构,这是我们工作的核心。首先,由于异质性和缺乏模式,我们建议使用本体论(包括层次结构中的所有子概念)作为摘要的主题,并提取它们的属性(相邻顶点)。然后,我们通过上升这些本体论概念层次结构(通过基于属性的归纳),我们获得了额外的广义摘要。我们为摘要和限定员展示了过程,并提出了各自的不精确措施-T-2和T-9的延伸。我们在DBPedia-One对DBPedia进行了两个实验,摘要主题的“艺术家”,而第二个是“音乐专辑”。对于后者,我们展示了使用自下而上方法获得真实值的优化过程。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号