首页> 外文会议>2nd International Conference on Information Technology and Electronic Commerce >A hot topic detection method for Chinese Microblog based on topic words
【24h】

A hot topic detection method for Chinese Microblog based on topic words

机译:基于主题词的中文微博热点话题检测方法

获取原文
获取原文并翻译 | 示例

摘要

Microblog is a kind of new network medium which sprang up quickly. Detection and tracking of hot topics through Microblog has attracted wide attentions from scholars at home and abroad in recent years. The algorithm which aims at finding topics in long text messages such as in traditional news websites and blogs, etc. can't effectively be used in disposing the Microblog data with a property of sparseness. This paper contributes a method, which aims to identify hot topics in Microblog based on the topic words. This method, throughpre-treating the Microblog data and dividing the time-window, extracts topic words in the Microblog data according to the two factors of increasing rate of word frequency and relative word frequency from Microblog data in every time-window. And then extracts and clusters the topic words according to the similarity among them, sieving for a suitable cluster of topic words so as to describe the hot topic and realize the detection of hot topic in Microblog. Through experimental verification, this method can improve the efficiency of detection to a certain extent, and raise the recall ratio and the precision ratio, so as to find hot topic in Microblog effectively and timely.
机译:微博客是一种迅速兴起的新型网络媒体。近年来,通过微博对热点话题进行检测和跟踪已经引起了国内外学者的广泛关注。旨在在诸如传统新闻网站和博客等长文本消息中查找主题的算法不能有效地用于处理具有稀疏属性的微博客数据。本文提出了一种基于主题词来识别微博中热门主题的方法。该方法通过对微博数据进行预处理和时窗划分,根据每个时间窗口的微博数据中词频和相对词频增加的两个因素,提取微博数据中的主题词。然后根据主题词之间的相似度对主题词进行提取和聚类,筛选出合适的主题词聚类,以描述热点话题,实现对微博客中热点话题的检测。通过实验验证,该方法可以在一定程度上提高检测效率,提高查全率和查准率,从而可以及时有效地在微博中找到热点。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号