首页> 外文会议>IEEE International Congress on Big Data >Nimbus: Tuning Filters Service on Tweet Streams
【24h】

Nimbus: Tuning Filters Service on Tweet Streams

机译:nimbus:调整过滤器在推文流上的服务

获取原文

摘要

With hundreds of millions of tweets being generated by Twitter users every day, tweet analysis has drawn considerable attention for event detection and trending sentiment indication. The problem is finding the few important tweets in this huge volume of traffic. A number of systems provide applications the ability to filter a complete or partial Twitter stream based on keywords and/or text properties to try to separate the relevant tweets from all of the noise. Designing a filter to produce useful results can be extremely difficult. For instance, consider the problem of finding tweets related to the Target Corporation or Guess USA. Just scanning the text of tweets for "target" or "guess" is likely to generate lots of hits, but few really relevant tweets. Nimbus is a service that can be used to tune filters on tweet streams. The Nimbus service builds a database of tweets from a Twitter stream (it does not have to be a full Twitter fire hose) and provides an API for testing filters (based on the Power Track language and Spark as evaluation engine) against the database. The important feature of Nimbus is that it allows repeatable testing of filter expressions against real Twitter data using the same filter language that can be used against live Twitter streams. This makes it possible for users of the service to tune their filters before putting them into production use.
机译:每天推特用户产生数亿推文,推文分析对事件检测和趋势情绪指示造成了相当大的关注。问题在于在这一大量交通中找到了很少的重要推文。许多系统提供应用程序基于关键字和/或文本属性来过滤完整或部分Twitter流,以尝试将相关推文与所有噪声分开。设计过滤器以产生有用的结果可能是非常困难的。例如,考虑找到与目标公司或猜测USA相关的推文的问题。只需扫描“目标”或“猜测”的推文文本可能会产生很多点击,但很少有关的推文。 Nimbus是一项服务,可用于调整推文流中的过滤器。 NimBus服务从Twitter流构建了一条推文数据库(它不必是完整的Twitter Fire软管),并为数据库提供用于测试过滤器的API(基于电源轨道语言和火花作为评估引擎)。 NimBus的重要特征是它允许使用与现场推特流的相同滤波器语言相反的滤波器语言来重复测试滤波器表达式。这使得服务的用户可以在将它们放入生产使用之前调整其过滤器。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号