【24h】

Trending Topics Rank Prediction

机译:趋势主题排名预测

获取原文

摘要

Many web services, such as Twitter and Google, provide a list of their most popular terms, called a trending topics list, in descending order of popularity ranking. The changes in people’s interest in a specific trending topic are reflected in the changes of its popularity rank (up, down, and unchanged). This paper analyses the nature of trending topics and proposes a temporal modelling framework for predicting rank change of trending topics using historical rank data. Historical rank data show that almost 70% of trending topics tend to disappear and reappear later. Therefore it is important to reflect this phenomenon in the prediction model, which is related to handling missing value and window size. Missing value handling approach was selected by using expectation maximization. An optimal window size is selected based on the minimum length of topic disappearance in the same topic but with a different context. We examined our approach with four machine-learning techniques using the U.S. twitter trending topics collected from 30th June 2012 to 30th June 2014. Our model achieved the highest prediction accuracy (94.01 %) with C4.5 decision tree algorithm.
机译:许多Web服务,例如Twitter和Google,提供了他们最受欢迎的术语,称为趋势主题列表,以普及排名的降序。人们对特定趋势主题的兴趣的变化反映在其受欢迎程度(上,下降和不变)的变化中。本文分析了趋势主题的性质,提出了一种使用历史等级数据预测趋势主题的等级变更的时间建模框架。历史排名数据显示,近70%的趋势主题往往会消失和重新出现。因此,重要的是反映在预测模型中的这种现象,这与处理缺失值和窗口大小有关。使用期望最大化选择缺失的值处理方法。根据同一主题中的最小主题消失但具有不同的上下文,选择最佳窗口大小。我们通过从2012年6月30日至2014年6月30日收集的U.S. Twitter Trending主题,通过了四种机器学习技术进行了探讨了我们的方法。我们的模型实现了最高的预测准确性(94.01%),具有C4.5决策树算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号