首页> 外文会议>International Conference on Big Data Computing, Applications and Technologies >CoS-HDFS: Co-Locating Geo-Distributed Spatial Data in Hadoop Distributed File System
【24h】

CoS-HDFS: Co-Locating Geo-Distributed Spatial Data in Hadoop Distributed File System

机译:CoS-HDFS:在Hadoop分布式文件系统中共同定位地理分布的空间数据

获取原文
获取原文并翻译 | 示例

摘要

Given the recent advancement in the ubiquitous positioning technologies, it is now common to query terabytes of spatial data. These massive data are usually geo-distributed across multiple data centers to ensure their availability. Yet, at least one replica of the data is stored close to where the data are generated. Spatial queries are complex and computationally intensive, and therefore, distributed computation platforms, such as Hadoop are now used to improve their execution time. However, Hadoop is agnostic to the spatial data characteristics, and it randomly partitions and locates the data stored in its distributed file system which degrades the performance of the execution of spatial queries. In this paper, we propose CoS-HDFS, an extension to the Hadoop Distributed File System (HDFS) that takes into account the spatial characteristics of the data and accordingly co-locates them on the HDFS nodes that span multiple data centers. We integrate CoS-HDFS with SpatialHadoop, a MapReduce framework that natively supports spatial data, to make use of its implementation of spatial indexes, operations, and query interfaces. We experimentally demonstrate significant reduction in the network usage and total execution time in the case of spatial join queries on the TIGER dataset.
机译:鉴于无处不在的定位技术的最新发展,现在查询数TB的空间数据非常普遍。通常将这些海量数据地理分布在多个数据中心中,以确保其可用性。但是,数据的至少一个副本存储在生成数据的位置附近。空间查询非常复杂且计算量很大,因此,现在使用分布式计算平台(例如Hadoop)来缩短其执行时间。但是,Hadoop与空间数据特性无关,它会随机分区和定位存储在其分布式文件系统中的数据,这会降低空间查询执行的性能。在本文中,我们提出了CoS-HDFS,它是Hadoop分布式文件系统(HDFS)的扩展,它考虑了数据的空间特征,因此将它们共同定位在跨越多个数据中心的HDFS节点上。我们将CoS-HDFS与SpatialHadoop(一个本机支持空间数据的MapReduce框架)集成在一起,以利用其对空间索引,操作和查询接口的实现。我们通过实验证明,在TIGER数据集上进行空间联接查询时,网络使用率和总执行时间显着减少。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号