首页> 外文期刊>Visualization and Computer Graphics, IEEE Transactions on >How Hierarchical Topics Evolve in Large Text Corpora
【24h】

How Hierarchical Topics Evolve in Large Text Corpora

机译:大文本语料库中层次主题的演变

获取原文
获取原文并翻译 | 示例
           

摘要

Using a sequence of topic trees to organize documents is a popular way to represent hierarchical and evolving topics in text corpora. However, following evolving topics in the context of topic trees remains difficult for users. To address this issue, we present an interactive visual text analysis approach to allow users to progressively explore and analyze the complex evolutionary patterns of hierarchical topics. The key idea behind our approach is to exploit a tree cut to approximate each tree and allow users to interactively modify the tree cuts based on their interests. In particular, we propose an incremental evolutionary tree cut algorithm with the goal of balancing 1) the fitness of each tree cut and the smoothness between adjacent tree cuts; 2) the historical and new information related to user interests. A time-based visualization is designed to illustrate the evolving topics over time. To preserve the mental map, we develop a stable layout algorithm. As a result, our approach can quickly guide users to progressively gain profound insights into evolving hierarchical topics. We evaluate the effectiveness of the proposed method on Amazon's Mechanical Turk and real-world news data. The results show that users are able to successfully analyze evolving topics in text data.
机译:使用主题树序列组织文档是在文本语料库中表示层次结构和不断发展的主题的一种流行方法。但是,对于用户而言,在主题树的上下文中关注不断发展的主题仍然很困难。为了解决此问题,我们提出了一种交互式的可视化文本分析方法,使用户能够逐步探索和分析层次结构主题的复杂演化模式。我们方法背后的关键思想是利用一个树木砍伐来逼近每棵树木,并允许用户根据自己的兴趣交互式地修改树木砍伐。特别是,我们提出了一种增量进化树切割算法,其目标是平衡1)每个树形切割的适应性和相邻树形切割之间的平滑度; 2)与用户兴趣有关的历史信息和新信息。基于时间的可视化旨在说明随时间变化的主题。为了保留思维导图,我们开发了一种稳定的布局算法。因此,我们的方法可以快速指导用户逐步了解不断发展的层次主题。我们在亚马逊的Mechanical Turk和真实新闻数据上评估了该方法的有效性。结果表明,用户能够成功分析文本数据中不断发展的主题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号