首页> 外文会议>International Conference on Systems and Informatics >Micro-blog hot topic discovery based on string frequency
【24h】

Micro-blog hot topic discovery based on string frequency

机译:基于字符串频率的微博热点话题发现

获取原文

摘要

This paper puts forward a method to find the hot topic in the micro-blog text data set: select the most frequent strings and then find the hot topic from these frequent strings, the length of the string is set to 1, 2 to 8 words respectively, the most frequent top 50 strings were selected from each group, altogether 400 strings, by browsing these 400 strings, the current hot topics among the micro-blog data set can be found. Experiment demonstrates that these 400 frequent strings can be got in short time, and the hot topics of micro-blog data sets can be found from these 400 frequent strings.
机译:提出了一种在微博文本数据集中查找热点的方法:选择最频繁的字符串,然后从这些频繁的字符串中查找热点,字符串的长度设置为1、2至8个字分别从每组中选择最频繁的前50个字符串,总共400个字符串,通过浏览这400个字符串,可以找到微博数据集中当前的热门话题。实验表明,可以在短时间内获得这400个常用字符串,并且可以从这400个常用字符串中找到微博客数据集的热门话题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号