首页> 外文会议>International symposium on advances in spatial and temporal databases >ST-Hadoop: A MapReduce Framework for Spatio-Temporal Data
【24h】

ST-Hadoop: A MapReduce Framework for Spatio-Temporal Data

机译:ST-Hadoop:时空数据的MapReduce框架

获取原文

摘要

This paper presents ST-Hadoop; the first full-fledged open-source MapReduce framework with a native support for spatio-temporal data. ST-Hadoop is a comprehensive extension to Hadoop and Spatial-Hadoop that injects spatio-temporal data awareness inside each of their layers, mainly, language, indexing, and operations layers. In the language layer, ST-Hadoop provides built in spatio-temporal data types and operations. In the indexing layer, ST-Hadoop spatiotemporally loads and divides data across computation nodes in Hadoop Distributed File System in a way that mimics spatio-temporal index structures, which result in achieving orders of magnitude better performance than Hadoop and SpatialHadoop when dealing with spatio-temporal data and queries. In the operations layer, ST-Hadoop shipped with support for two fundamental spatio-temporal queries, namely, spatio-temporal range and join queries. Extensibility of ST-Hadoop allows others to expand features and operations easily using similar approach described in the paper. Extensive experiments conducted on large-scale dataset of size 10 TB that contains over 1 Billion spatio-temporal records, to show that ST-Hadoop achieves orders of magnitude better performance than Hadoop and SpaitalHadoop when dealing with spatio-temporal data and operations. The key idea behind the performance gained in ST-Hadoop is its ability in indexing spatio-temporal data within Hadoop Distributed File System.
机译:本文介绍了ST-Hadoop;第一个完整的开源MapReduce框架,对时空数据提供本机支持。 ST-Hadoop是Hadoop和Spatial-Hadoop的全面扩展,可将时空数据感知注入到其每个层(主要是语言,索引和操作层)中。在语言层,ST-Hadoop提供了内置的时空数据类型和操作。在索引层中,ST-Hadoop时空加载并在Hadoop分布式文件系统中的计算节点之间划分数据,并模仿时空索引结构,与时空索引结构相比,与Hadoop和SpatialHadoop相比,其性能提高了几个数量级。时态数据和查询。在操作层,ST-Hadoop附带了对两个基本时空查询的支持,即时空范围和联接查询。 ST-Hadoop的可扩展性使其他人可以使用本文中描述的类似方法轻松扩展功能和操作。在包含超过10亿个时空记录的10 TB大小的大型数据集上进行的广泛实验表明,在处理时空数据和操作时,ST-Hadoop的性能要比Hadoop和SpaitalHadoop高出几个数量级。在ST-Hadoop中获得性能背后的关键思想是它能够在Hadoop分布式文件系统中索引时空数据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号