首页> 外文会议>International Conference on New Trends in Intelligent Software Methodologies, Tools and Techniques >Keyphrase Graph in Text Representation for Document Similarity Measurement
【24h】

Keyphrase Graph in Text Representation for Document Similarity Measurement

机译:文档相似度测量的文本表示中的关键词图

获取原文

摘要

To represent the text document more expressively, a kind of graph-based semantic model is proposed, in which more semantic information among keyphrases as well as the structural information of the text are incorporated. The method produces structured representations of texts by utilizing common, popular knowledge bases (e.g. DBpedia, Wikipedia) to acquire fine-grained information about concepts, entities, and their semantic relations, thus resulting in a knowledge-rich interpretation. We demonstrate the benefits of these representations in the task of document similarity measurement. Relevance evaluation between two documents is done by calculating the semantic similarity between two keyphrase graphs that represent them. Experimental results show that our approach outperforms standard baselines based on traditional document representations, and able to come close in performance to the specialized methods particularly tuned to this task on the specific dataset.
机译:为了更富有表达文本文档,提出了一种基于图形的语义模型,其中包含关键词中的更多语义信息以及文本的结构信息。 该方法通过利用共同的流行知识库(例如DBPedia,Wikipedia)来产生文本的结构化表示,以获得有关概念,实体和语义关系的细粒度信息,从而导致知识丰富的解释。 我们展示了这些陈述在文件相似度测量的任务中的好处。 通过计算代表它们的两个关键字图之间的语义相似性来完成两个文档之间的相关性评估。 实验结果表明,我们的方法超越了基于传统文档表示的标准基线,并且能够在特定数据集上特别调整到此任务的专业方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号