首页> 外文会议>International conference on recent advances in natural language processing >Hierarchical Topic Structuring: From Dense Segmentation to Topically Focused Fragments via Burst Analysis
【24h】

Hierarchical Topic Structuring: From Dense Segmentation to Topically Focused Fragments via Burst Analysis

机译:分层主题结构:通过突发分析从密集分割到局部集中的片段

获取原文

摘要

Topic segmentation traditionally relies on lexical cohesion measured through word re-occurrences to output a dense segmentation, either linear or hierarchical. In this paper, a novel organization of the topical structure of textual content is proposed. Rather than searching for topic shifts to yield dense segmentation, we propose an algorithm to extract topically focused fragments organized in a hierarchical manner. This is achieved by leveraging the temporal distribution of word re-occurrences, searching for bursts, to skirt the limits imposed by a global counting of lexical reoccurrences within segments. Comparison to a reference dense segmentation on varied datasets indicates that we can achieve a better topic focus while retrieving all of the important aspects of a text.
机译:传统上,主题细分依赖于通过单词重复出现而测得的词汇衔接,以输出线性或分层的密集细分。本文提出了一种新颖的文本内容主题结构组织。我们提出了一种算法来提取以层次结构组织的局部聚焦片段,而不是搜索主题转移以产生密集的细分。这是通过利用单词重现的时间分布,搜索突发来克服由段内词汇重现的全局计数所施加的限制来实现的。与在各种数据集上进行的参考密集分割相比,表明我们可以在检索文本的所有重要方面的同时实现更好的主题重点。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号