首页> 外文期刊>The VLDB journal >Top-k spatial-keyword publish/subscribe over sliding window
【24h】

Top-k spatial-keyword publish/subscribe over sliding window

机译:在滑动窗口上发布/订阅的前k个空间关键字

获取原文
获取原文并翻译 | 示例
       

摘要

With the prevalence of social media and GPS-enabled devices, a massive amount of geo-textual data have been generated in a stream fashion, leading to a variety of applications such as location-based recommendation and information dissemination. In this paper, we investigate a novel real-time top- monitoring problem over sliding window of streaming data; that is, we continuously maintain the top-k most relevant geo-textual messages (e.g., geo-tagged tweets) for a large number of spatial-keyword subscriptions (e.g., registered users interested in local events) simultaneously. To provide the most recent information under controllable memory cost, sliding window model is employed on the streaming geo-textual data. To the best of our knowledge, this is the first work to study top- spatial-keyword publish/subscribe over sliding window. A novel centralized system, called Skype (Top-k Spatial-keyword Publish/Subscribe), is proposed in this paper. In Skype, to continuously maintain top- results for massive subscriptions, we devise a novel indexing structure upon subscriptions such that each incoming message can be immediately delivered on its arrival. To reduce the expensive top- re-evaluation cost triggered by message expiration, we develop a novel cost-based k -skyband technique to reduce the number of re-evaluations in a cost-effective way. Extensive experiments verify the great efficiency and effectiveness of our proposed techniques. Furthermore, to support better scalability and higher throughput, we propose a distributed version of Skype, namely DSkype, on top of Storm, which is a popular distributed stream processing system. With the help of fine-tuned subscription/message distribution mechanisms, DSkype can achieve orders of magnitude speed-up than its centralized version.
机译:随着社交媒体和支持GPS的设备的普及,以流方式生成了大量的地理文本数据,从而导致了多种应用,例如基于位置的推荐和信息发布。在本文中,我们研究了流数据滑动窗口上的新型实时顶部监控问题。也就是说,我们会同时为大量的空间关键字订阅(例如,对本地事件感兴趣的注册用户)持续保持最相关的地理文本消息(例如,带有地理标签的推文)。为了在可控制的存储器成本下提供最新信息,在流式地理文本数据上采用了滑动窗口模型。据我们所知,这是研究在滑动窗口上发布/订阅顶级空间关键字的第一项工作。本文提出了一种新颖的集中式系统,称为Skype(Top-k空间关键字发布/订阅)。在Skype中,为了持续保持大量订阅的最佳结果,我们在订阅上设计了一种新颖的索引结构,以便每个传入消息在到达时都可以立即传递。为了减少消息过期触发的昂贵的最高重新评估成本,我们开发了一种基于成本的新颖的k-skyband技术,以经济高效的方式减少了重新评估的次数。大量的实验证明了我们提出的技术的巨大效率和有效性。此外,为了支持更好的可伸缩性和更高的吞吐量,我们在Storm之上提出了Skype的分布式版本,即DSkype,Storm是一种流行的分布式流处理系统。借助微调的订阅/消息分发机制,DSkype可以比其集中版本提高几个数量级的速度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号