首页> 外文会议>IEEE International Conference on Data Engineering >Distributed Publish/Subscribe Query Processing on the Spatio-Textual Data Stream
【24h】

Distributed Publish/Subscribe Query Processing on the Spatio-Textual Data Stream

机译:Spatio-Textual数据流上的分布式发布/订阅查询处理

获取原文

摘要

Huge amount of data with both space and text information, e.g., geo-tagged tweets, is flooding on the Internet. Such spatio-textual data stream contains valuable information for millions of users with various interests on different keywords and locations. Publish/subscribe systems enable efficient and effective information distribution by allowing users to register continuous queries with both spatial and textual constraints. However, the explosive growth of data scale and user base has posed challenges to the existing centralized publish/subscribe systems for spatiotextual data streams. In this paper, we propose our distributed publish/subscribe system, called PS~2 Stream, which digests a massive spatio-textual data stream and directs the stream to target users with registered interests. Compared with existing systems, PS~2 Stream achieves a better workload distribution in terms of both minimizing the total amount of workload and balancing the load of workers. To achieve this, we propose a new workload distribution algorithm considering both space and text properties of the data. Additionally, PS~2 Stream supports dynamic load adjustments to adapt to the change of the workload, which makes PS~2 Stream adaptive. Extensive empirical evaluation, on commercial cloud computing platform with real data, validates the superiority of our system design and advantages of our techniques on system performance improvement.
机译:具有空间和文本信息的大量数据,例如,地理标记的推文是互联网上的洪水。此类时空数据流包含数百万用户在不同关键字和位置的各种兴趣的有价值信息。发布/订阅系统通过允许用户使用空间和文本约束来注册连续查询来实现高效和有效的信息分发。然而,数据量表和用户基数的爆炸性增长对现有的时空文约数据流构成了对现有的集中发布/订阅系统的挑战。在本文中,我们提出了我们的分布式发布/订阅系统,称为PS〜2流,其摘要摘要大量的时空数据流,并将流指定为具有注册兴趣的目标用户。与现有系统相比,PS〜2流既可以最小化工作量的总量和平衡工人的负载,达到更好的工作量分布。为此,我们提出了一种新的工作负载分发算法,考虑到数据的空间和文本属性。此外,PS〜2流支持动态负载调整,以适应工作负载的变化,从而使PS〜2流自适应。在具有实际数据的商业云计算平台上的广泛实证评估,验证了我们系统性能改进技术的系统设计和优势的优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号