Keyphrase Graph in Text Representation for Document Similarity Measurement

机译：文档相似度测量的文本表示中的关键词图

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

To represent the text document more expressively, a kind of graph-based semantic model is proposed, in which more semantic information among keyphrases as well as the structural information of the text are incorporated. The method produces structured representations of texts by utilizing common, popular knowledge bases (e.g. DBpedia, Wikipedia) to acquire fine-grained information about concepts, entities, and their semantic relations, thus resulting in a knowledge-rich interpretation. We demonstrate the benefits of these representations in the task of document similarity measurement. Relevance evaluation between two documents is done by calculating the semantic similarity between two keyphrase graphs that represent them. Experimental results show that our approach outperforms standard baselines based on traditional document representations, and able to come close in performance to the specialized methods particularly tuned to this task on the specific dataset.

机译：为了更富有表达文本文档，提出了一种基于图形的语义模型，其中包含关键词中的更多语义信息以及文本的结构信息。该方法通过利用共同的流行知识库（例如DBPedia，Wikipedia）来产生文本的结构化表示，以获得有关概念，实体和语义关系的细粒度信息，从而导致知识丰富的解释。我们展示了这些陈述在文件相似度测量的任务中的好处。通过计算代表它们的两个关键字图之间的语义相似性来完成两个文档之间的相关性评估。实验结果表明，我们的方法超越了基于传统文档表示的标准基线，并且能够在特定数据集上特别调整到此任务的专业方法。

著录项

来源
《International Conference on New Trends in Intelligent Software Methodologies, Tools and Techniques》|2020年|478p|共14页
会议地点
作者
ThanhThuong T. HUYNH; TruongAn PHAMNGUYEN; Nhon V. DO;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP311.5-53;
关键词
Document representation; Graph-based document model; Keyphrase extraction; Document similarity; Graph matching;

机译：文档表示;基于图形的文档模型;关键词提取;文档相似;图匹配;

相似文献

外文文献
中文文献
专利

1. Deep Text Mining for Automatic Keyphrase Extraction from Text Documents [J] . Muhammad Abulaish, Jahiruddin, Lipika Dey Journal of Intelligent Systems . 2011,第4期

机译：深度文本挖掘，用于从文本文档中自动提取关键词
2. Automatic Multi-Document Arabic Text Summarization Using Clustering and Keyphrase Extraction [J] . Hamzah Noori Fejer, Nazlia Omar Journal of Artificial Intelligence . 2015,第1期

机译：使用聚类和关键词提取的自动多文档阿拉伯文本摘要
3. A Keyphrase-Based Approach to Text Summarization for English and Bengali Documents [J] . Kamal Sarkar International journal of technology diffusion . 2014,第2期

机译：基于关键字的英语和孟加拉语文档文本摘要方法
4. Keyphrase Graph in Text Representation for Document Similarity Measurement [C] . ThanhThuong T. HUYNH, TruongAn PHAMNGUYEN, Nhon V. DO International Conference on New Trends in Intelligent Software Methodologies, Tools and Techniques . 2020

机译：文档相似度测量的文本表示中的关键词图
5. A semantic graph model for text representation and matching in document mining. [D] . Shaban, Khaled. 2006

机译：用于文档挖掘中文本表示和匹配的语义图模型。
6. Assessing the Representation of Occupation Information in Free-Text Clinical Documents Across Multiple Sources [O] . Elizabeth A. Lindemann, Elizabeth S. Chen, Sripriya Rajamani, -1

机译：评估多种来源的自由文本临床文档中职业信息的表示形式
7. Comparative Analysis of N-gram Text Representation on Igbo Text Document Similarity [O] . Ifeanyi-Reuben Nkechi J., Ugwu Chidiebere, Nwachukwu E. O. 2017

机译：N-GRAN文本表示对IGBO文本文献相似性的比较分析

Keyphrase Graph in Text Representation for Document Similarity Measurement

摘要

著录项

相似文献

相关主题

期刊订阅