首页> 外文期刊>PLoS One >Word synonym relationships for text analysis: A graph-based approach
【24h】

Word synonym relationships for text analysis: A graph-based approach

机译:文本分析的单词同义词关系:基于图形的方法

获取原文
           

摘要

Keyword extraction refers to the process of detecting the most relevant terms and expressions in a given text in a timely manner. In the information explosion era, keyword extraction has attracted increasing attention. The importance of keyword extraction in text summarization, text comparisons, and document categorization has led to an emphasis on graph-based keyword extraction techniques because they can capture more structural information compared to other classic text analysis methods. In this paper, we propose a simple unsupervised text mining approach that aims to extract a set of keywords from a given text and analyze its topic diversity using graph analysis tools. Initially, the text is represented as a directed graph using synonym relationships. Then, community detection and other measures are used to identify keywords in the text. The set of extracted keywords is used to assess topic diversity within the text and analyze its sentiment. The proposed approach relies on grouping semantically similar candidate words. This approach ensures that the set of extracted keywords is comprehensive. Differing from other graph-based keyword extraction approaches, the proposed method does not require user parameters during graph construction and word scoring. The proposed approach achieved significant results compared to other keyword extraction techniques.
机译:关键字提取是指及时检测给定文本中最相关的术语和表达的过程。在信息爆炸时代,关键词提取引起了越来越多的关注。关键字提取在文本摘要,文本比较和文档分类中的重要性导致了强调基于图形的关键字提取技术,因为它们可以与其他经典文本分析方法相比捕获更多结构信息。在本文中,我们提出了一种简单的无监督文本挖掘方法,旨在通过Graph分析工具来分析其主题分集的一组关键字。最初,文本用同义词关系表示为定向图。然后,使用社区检测和其他措施来识别文本中的关键字。该组提取的关键字用于评估文本中的主题分集并分析其情绪。建议的方法依赖于分组语义相似的候选词。这种方法确保了该组提取的关键字是全面的。与其他基于图形的关键字提取方法不同,所提出的方法在图形构建和Word评分期间不需要用户参数。与其他关键字提取技术相比,所提出的方法实现了显着的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号