首页> 外文会议>ACM international conference on information and knowledge management >Theme Chronicle Model: Chronicle Consists of Timestamp and Topical Words over Each Theme
【24h】

Theme Chronicle Model: Chronicle Consists of Timestamp and Topical Words over Each Theme

机译:主题编年史模型:编年史由时间戳和每个主题上的主题词组成

获取原文

摘要

This paper presents a topic model that discovers the correlation patterns in a given time-stamped document collection and how these patterns evolve over time. Our proposal, the theme chronicle model (TCM) divides traditional topics into temporal and stable topics to detect the change of each theme over time; previous topic models ignore these differences and characterize trends as merely bursts of topics. TCM introduces a theme topic (stable topic), a trend topic (temporal topic), timestamps, and a latent switch variable in each token to realize these differences. Its topic layers allow TCM to capture not only word co-occurrence patterns in each theme, but also word co-occurrence patterns at any given time in each theme as trends. Experiments on various data sets show that the proposed model is useful as a generative model to discover fine-grained tightly coherent topics, takes advantage of previous models, and then assigns values for new documents.
机译:本文提出了一个主题模型,该模型发现了给定时间戳文档集合中的相关模式以及这些模式如何随着时间演变。我们的建议是主题编年史模型(TCM),将传统主题分为时间主题和稳定主题,以检测每个主题随时间的变化。以前的主题模型忽略了这些差异,仅将趋势描述为突发的主题。 TCM在每个令牌中引入了主题主题(稳定主题),趋势主题(时间主题),时间戳和潜在切换变量,以实现这些差异。它的主题层使TCM不仅可以捕获每个主题中的单词共现模式,还可以捕获每个主题中任意给定时间的单词共现模式作为趋势。在各种数据集上进行的实验表明,所提出的模型可以用作生成模型,以发现细粒度的紧密连贯主题,利用以前的模型,然后为新文档分配值。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号