首页> 外文会议>Pacific-Asia conference on knowledge discovery and data mining >On Burst Detection and Prediction in Retweeting Sequence
【24h】

On Burst Detection and Prediction in Retweeting Sequence

机译:转推序列中的突发检测和预测

获取原文

摘要

Message propagation via retweet chain can be regarded as a social contagion process. In this paper, we examine burst patterns in retweet activities. A burst is a large number of retweets of a particular tweet occurring within a certain short time window. The occurring of a burst indicates the original tweet receives abnormally high attentions during the burst period. It will be imperative to characterize burst patterns and develop algorithms to detect and predict bursts. We propose the use of the Cantelli's inequality to identify bursts from retweet sequence data. We conduct a comprehensive empirical analysis of a large microblogging dataset collected from the Sina Weibo and report our observations of burst patterns. Based on our empirical findings, we extract various features from users' profiles, fellowship topology, and message topics and investigate whether and how accurate we can predict bursts using classifiers based on the extracted features. Our empirical study of the Sina Weibo data shows the feasibility of burst prediction using appropriately extracted features and classic classifiers.
机译:通过转推链传播消息可以被视为一种社会传播过程。在本文中,我们研究了转推活动中的突发模式。突发是在某个短时间窗口内发生的特定推文的大量转发。突发事件的发生表明原始推文在突发事件期间受到异常高的关注。表征突发模式并开发检测和预测突发的算法将势在必行。我们建议使用Cantelli不等式从转推序列数据中识别突发。我们对从新浪微博收集的大型微博数据集进行了全面的经验分析,并报告了我们对突发模式的观察。基于我们的经验发现,我们从用户的个人资料,研究金拓扑和消息主题中提取了各种特征,并研究了基于提取的特征使用分类器预测突发的准确性和准确性。我们对新浪微博数据的实证研究表明,使用适当提取的特征和经典分类器进行突发预测的可行性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号