首页> 外文会议>IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining >What to Track on the Twitter Streaming API? A Knapsack Bandits Approach to Dynamically Update the Search Terms
【24h】

What to Track on the Twitter Streaming API? A Knapsack Bandits Approach to Dynamically Update the Search Terms

机译:在Twitter Streaming API上要跟踪什么?背包盗贼动态更新搜索词的方法

获取原文
获取外文期刊封面目录资料

摘要

We use Twitter streaming API for many purposes like monitoring brands and discovering events. Because Twitter Streaming API only allows tracking words (commonly called ‘search-terms'), the data collection goal needs to be formulated in terms of search terms. Twitter limits the number of search terms that can be tracked using the API, and the number of tweets retrieved per search-term depends on the terms being tracked. Therefore it's crucial to use a small set of highly relevant terms for tracking. Because social media is very dynamic and conversations evolve fast, the search terms that are relevant now might be less useful in as short of time as an hour. Manual monitoring of such discussions to update the search terms is cumbersome, error-prone and expensive. Can we have an algorithm to update the search terms based on the goals of the dataset collection? Taking inspiration from the knapsack bandits problem that effectively handle exploration (new search terms to explore) and exploitation (keep using useful search terms) when resources (network bandwidth, disk capacity or number of search terms) are constrained, we propose a new approach to dynamically update the search terms based on the goals of the data collection.
机译:我们将Twitter流API用于多种目的,例如监视品牌和发现事件。由于Twitter Streaming API仅允许跟踪单词(通常称为“搜索字词”),因此需要根据搜索字词来制定数据收集目标。 Twitter限制了可以使用API​​跟踪的搜索词的数量,每个搜索词检索到的推文的数量取决于所跟踪的词。因此,使用一小组高度相关的术语进行跟踪至关重要。由于社交媒体非常活跃,并且对话发展很快,因此现在相关的搜索词在短短一个小时内可能就没有用了。手动监视这样的讨论以更新搜索词是麻烦的,容易出错的并且昂贵的。我们是否可以有一种算法可以根据数据集收集的目标来更新搜索词?从背包土匪问题中汲取灵感,当资源(网络带宽,磁盘容量或搜索词数)受到限制时,可以有效处理勘探(要探索的新搜索词)和开发(继续使用有用的搜索词)的背包土匪问题,我们提出了一种新的解决方案根据数据收集的目标动态更新搜索词。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号