...
【24h】

ST-Hadoop: a MapReduce framework for spatio-temporal data

机译:St-Hadoop:Spatio-Tempyal Data的MapReduce框架

获取原文
获取原文并翻译 | 示例
           

摘要

This paper presents ST-Hadoop; the first full-fledged open-source MapReduce framework with a native support for spatio-temporal data. ST-Hadoop is a comprehensive extension to Hadoop and SpatialHadoop that injects spatio-temporal data awareness inside each of their layers, mainly, language, indexing, and operations layers. In the language layer, ST-Hadoop provides built in spatio-temporal data types and operations. In the indexing layer, ST-Hadoop spatiotemporally loads and divides data across computation nodes in Hadoop Distributed File System in a way that mimics spatio-temporal index structures, which result in achieving orders of magnitude better performance than Hadoop and SpatialHadoop when dealing with spatio-temporal data and queries. In the operations layer, ST-Hadoop shipped with support for three fundamental spatio-temporal queries, namely, spatio-temporal range, top-k nearest neighbor, and join queries. Extensibility of ST-Hadoop allows others to extend features and operations easily using similar approaches described in the paper. Extensive experiments conducted on large-scale dataset of size 10 TB that contains over 1 Billion spatio-temporal records, to show that ST-Hadoop achieves orders of magnitude better performance than Hadoop and SpaitalHadoop when dealing with spatio-temporal data and operations. The key idea behind the performance gained in ST-Hadoop is its ability in indexing spatio-temporal data within Hadoop Distributed File System.
机译:本文介绍了St-Hadoop;第一个全面开源MapReduce框架,具有用于时空数据的本机支持。 St-Hadoop是Hadoop和SpatialHadoop的全面扩展,它在每个层内注入了时空数据感知,主要是语言,索引和操作层。在语言层中,St-Hadoop提供内置的时空数据类型和操作。在索引层中,ST-Hadoop Spatibemporallemporally负载并划分Hadoop分布式文件系统中的计算节点的数据,以模仿时空索引结构的方式,这导致在处理Spatio时比Hadoop和SpatialHadoop成绩更好的性能。时间数据和查询。在操作层中,St-Hadoop附带了三个基本的时空查询,即时空范围,Top-K最近邻居和加入查询。 ST-Hadoop的可扩展性允许其他人使用纸张中描述的类似方法轻松扩展特征和操作。在大小10 TB的大规模数据集上进行的广泛实验,其中包含超过10亿的时空记录,表明St-Hadoop在处理时空数据和操作时比Hadoop和Spaitalhadoop实现更好的性能。 ST-Hadoop中获得的性能背后的关键想法是它在Hadoop分布式文件系统中索引时空数据的能力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号