首页> 外文会议>IEEE International Conference on Mobile Data Management >Toward Efficient Processing of Spatio-Temporal Workloads in a Distributed In-Memory System
【24h】

Toward Efficient Processing of Spatio-Temporal Workloads in a Distributed In-Memory System

机译:在分布式内存系统中实现时空工作负载的高效处理

获取原文

摘要

Location-based services (LBS) are a widely adopted technology that produces large volumes of spatio-temporal data at high velocity. Spatial data is also being generated from many other geo-spatial applications. To address the challenge of data volume, a number of big spatial data management systems have emerged that are based on the MapReduce paradigm. Recent projects have developed spatial data systems using Spark's distributed in-memory architecture. These projects, which include GeoSpark, SpatialSpark, and LocationSpark, do not support the high update rates required by LBS applications. Alternatively, systems such as MD-HBase support data updates, but are hindered by the performance characteristics of HBase, which is a disk-oriented framework. We present DISTIL+, a distributed spatio-temporal data processing system designed for high velocity location data. Our system achieves high update throughput and low query latency by leveraging the APGAS (Asynchronous Partitioned Global Address Space) architecture to build a multi-level distributed in-memory index. We present extensive experimental evaluation of our system, comparing several indexing and data placement schemes, as well as competing systems. Our results show that DISTIL+ excels at supporting high throughput location updates, and low latency spatio-temporal range queries and kNN queries, while offering better performance than existing approaches.
机译:基于位置的服务(LBS)是一种广泛采用的技术,可在高速下产生大量的时空数据。空间数据也来自许多其他地理空间应用程序。为了解决数据量的挑战,已经出现了基于MapReduce范例的许多大空间数据管理系统。最近的项目开发了使用Spark的分布式内存架构的空间数据系统。这些项目包括Geospark,Spatialspark和LocationsPark,不支持LBS应用程序所需的高更新率。或者,诸如MD-HBase支持数据更新的系统,但是由HBase的性能特征受阻,这是一个面向磁盘的框架。我们呈现DISTLIN +,一种专为高速位置数据而设计的分布式时空数据处理系统。我们的系统通过利用APGAS(异步分区全局地址空间)架构来实现高更新吞吐量和低查询延迟,以构建多级分布式内存索引。我们对我们的系统提供了广泛的实验评估,比较了几种索引和数据放置方案以及竞争系统。我们的结果表明,Distil + Excels在支持高吞吐量位置更新和低延迟时空范围查询和KNN查询,同时提供比现有方法更好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号