首页> 外文会议>2015 IEEE International Congress on Big Data >Nimbus: Tuning Filters Service on Tweet Streams
【24h】

Nimbus: Tuning Filters Service on Tweet Streams

机译:Nimbus:在Tweet流上优化过滤器服务

获取原文
获取原文并翻译 | 示例

摘要

With hundreds of millions of tweets being generated by Twitter users every day, tweet analysis has drawn considerable attention for event detection and trending sentiment indication. The problem is finding the few important tweets in this huge volume of traffic. A number of systems provide applications the ability to filter a complete or partial Twitter stream based on keywords and/or text properties to try to separate the relevant tweets from all of the noise. Designing a filter to produce useful results can be extremely difficult. For instance, consider the problem of finding tweets related to the Target Corporation or Guess USA. Just scanning the text of tweets for "target" or "guess" is likely to generate lots of hits, but few really relevant tweets. Nimbus is a service that can be used to tune filters on tweet streams. The Nimbus service builds a database of tweets from a Twitter stream (it does not have to be a full Twitter fire hose) and provides an API for testing filters (based on the Power Track language and Spark as evaluation engine) against the database. The important feature of Nimbus is that it allows repeatable testing of filter expressions against real Twitter data using the same filter language that can be used against live Twitter streams. This makes it possible for users of the service to tune their filters before putting them into production use.
机译:推特用户每天产生数亿条推文,推文分析已引起事件检测和趋势情绪指示的极大关注。问题是在如此庞大的流量中找到了一些重要的推文。许多系统为应用程序提供了基于关键字和/或文本属性过滤完整或部分Twitter流的功能,以尝试将相关推文与所有噪声区分开。设计滤波器以产生有用的结果可能非常困难。例如,考虑查找与Target Corporation或Guess USA相关的推文的问题。仅仅扫描推文中的“目标”或“猜测”很可能会产生很多点击,但真正相关的推文却很少。 Nimbus是一项服务,可用于调整tweet流上的过滤器。 Nimbus服务从Twitter流(不一定是完整的Twitter消防水带)构建推文数据库,并提供用于针对该数据库测试过滤器(基于Power Track语言和Spark作为评估引擎)的API。 Nimbus的重要功能是它允许使用与实时Twitter流相同的过滤器语言,对真实的Twitter数据进行过滤器表达式的可重复测试。这使得服务的用户可以在将其过滤器投入生产使用之前对其进行调整。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号