Visualization methods for single documents are either too simple, considering word frequency only, or depend on syntactic and semantic information bases to be more useful. This paper presents an intermediary approach, based on H. P. Luhn’s automatic abstract creation algorithm, and intends to aggregate more information to document visualization than word counting methods do without the need of external sources. The method takes pairs of relevant words and computes the linkage force between them. Relevant words become vertices and links become edges in the resulting graph.
展开▼
机译:单个文档的可视化方法要么太简单,仅考虑单词频率,要么依赖语法和语义信息库才更有用。本文提出了一种基于H. P. Luhn的自动摘要创建算法的中介方法,该方法旨在比单词计数方法不需要外部资源的情况下,将更多信息聚合到文档可视化中。该方法采用相关单词对,并计算它们之间的链接力。相关单词成为顶点,链接变为结果图中的边。
展开▼