...
首页> 外文期刊>ACM Transactions on Spatial Algorithms and Systems >SWARM: Adaptive Load Balancing in Distributed Streaming Systems for Big Spatial Data
【24h】

SWARM: Adaptive Load Balancing in Distributed Streaming Systems for Big Spatial Data

机译:Swarm:用于大空间数据的分布式流系统中的自适应负载平衡

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

The proliferation of GPS-enabled devices has led to the development of numerous location-based services. These services need to process massive amounts of streamed spatial data in real-time. The current scale of spatial data cannot be handled using centralized systems. This has led to the development of distributed spatial streaming systems. Existing systems are using static spatial partitioning to distribute the workload. In contrast, the real-time streamed spatial data follows non-uniform spatial distributions that are continuously changing over time. Distributed spatial streaming systems need to react to the changes in the distribution of spatial data and queries. This article introduces SWARM, a lightweight adaptivity protocol that continuously monitors the data and query workloads across the distributed processes of the spatial data streaming system and redistributes and rebalances the workloads as soon as performance bottlenecks get detected. SWARM is able to handle multiple query-execution and data-persistence models. A distributed streaming system can directly use SWARM to adaptively rebalance the system's workload among its machines with minimal changes to the original code of the underlying spatial application. Extensive experimental evaluation using real and synthetic datasets illustrate that, on average, SWARM achieves 2× improvement in throughput over a static grid partitioning that is determined based on observing a limited history of the data and query workloads. Moreover, SWARM reduces execution latency on average 4× compared with the other technique.
机译:支持GPS的设备的扩散导致了众多基于位置的服务的开发。这些服务需要在实时处理大量流式空间数据。无法使用集中系统处理当前的空间数据规模。这导致了分布式空间流系统的开发。现有系统正在使用静态空间分区来分发工作负载。相反,实时流空间数据遵循连续变化的不均匀空间分布。分布式空间流系统需要对空间数据和查询的分布的变化作出反应。本文介绍了一种轻量级适应性协议,连续监控空间数据流系统的分布式进程中的数据和查询工作负载,并在检测到性能瓶颈后重新分配和重新平衡工作负载。 Swarm能够处理多个查询执行和数据持久性模型。分布式流系统可以直接使用群体在其计算机中自适应地重新平衡系统的工作负载,其底层空间应用程序的原始代码最小。使用实际和合成数据集的广泛实验评估说明,平均而言,Swarm在基于观察数据和查询工作负载的有限历史上确定的静态网格分区吞吐量的吞吐量的提高。此外,与其他技术相比,Swarm平均减少了4倍的执行延迟。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号