首页> 外文会议>International Conference on Machine Learning and Cybernetics >Detecting hot topics in technology news streams
【24h】

Detecting hot topics in technology news streams

机译:检测技术新闻流中的热门话题

获取原文
获取外文期刊封面目录资料

摘要

Detecting hot topics with a fine granularity in technology news streams is an interesting and important problem given the large amount of reports and a relatively narrow range of topics. In this paper, a three-phase method is proposed. In the first phase, the document topic distribution vector is generated and keywords are extracted for each document using topic model pachinko allocation. In the second phase, the documents are clustered based on the document topic distribution vector obtained from the previous phase using affinity propagation. And in the last phase, actual events denoted by combinations of keywords within each cluster are found out using frequent pattern mining algorithms. We evaluate our approach on a collection of technology news reports from various sites in a fixed time period. T he results show that this method is effective.
机译:鉴于大量报告和相对较窄的主题范围,检测具有细粒度的热门话题是一个有趣而重要的问题。本文提出了一种三相方法。在第一阶段中,生成文档主题分布向量,并且使用主题模型Pachinko分配对每个文档提取关键字。在第二阶段中,基于使用亲和传播从先前阶段获得的文档主题分布向量群集文档。并且在最后一段中,使用频繁的模式挖掘算法发现每个集群内的关键字组合表示的实际事件。我们在一个固定时间段的各种网站的一系列技术新闻报告中评估了我们的方法。结果表明,这种方法是有效的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号