首页> 中文期刊> 《模式识别与人工智能》 >基于热度曲线分类建模的微博热门话题预测

基于热度曲线分类建模的微博热门话题预测

         

摘要

及时掌握大众关心的热点话题是企业进行商业创新和商务营销的重要前提。现有方法大都依赖于非结构化数据的处理或反复遍历样本集,使算法复杂性较高。文中从话题的统计特性出发,提出建立在结构化数据上的非参数方法。首先对单个话题构建表征话题传播扩散程度和关注聚焦程度的热度曲线;然后对这些形态丰富的热度曲线进行分类建模,得到不同类别曲线的共性特征及发展规律;最后使用分类模型上的加权投票规则预测新话题是否会发展成为热门话题。基于新浪微博平台进行数据收集和实验,结果表明该方法数据结构简单、效果良好、复杂度低且易于控制。%Timely acquiring of hot topics is of great significance for commercial innovation and business marketing. Existing methods mostly need to cope with non-structured data or repeated traversal sample set, which results in high complexity. In this paper, emphasizing the topic statistical properties, a non-parameter method based on structured data is proposed to acquire the hot topics in time. Firstly, diffusion degree and focus degree are introduced to build heat curves to characterize the topics. Then, the varied heat curves are classified to determine the common behaviors of the topics. Finally, the weighted-vote scheme is employed to predict whether a topic is trend or not. The experimental results on Sina microblog show that the proposed method has simple data structure and works well with low time complexity and simple manipulation.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号