首页> 外文期刊>Journal of geographical systems >HadoopTrajectory: a Hadoop spatiotemporal data processing extension
【24h】

HadoopTrajectory: a Hadoop spatiotemporal data processing extension

机译:Hadooptrajectory:Hadoop时空数据处理扩展

获取原文
获取原文并翻译 | 示例
           

摘要

The recent advances in location tracking technologies and the widespread use of location-aware applications have resulted in big datasets of moving object trajectories. While there exists a couple of research prototypes for moving object databases, there is a lack of systems that can process big spatiotemporal data. This work proposes HadoopTrajectory, a Hadoop extension for spatiotemporal data processing. The extension adds spatiotemporal types and operators to the Hadoop core. These types and operators can be directly used in MapReduce programs, which gives the Hadoop user the possibility to write spatiotemporal data analytics programs. The storage layer of Hadoop, the HDFS, is extended by types to represent trajectory data and their corresponding input and output functions. It is also extended by file splitters and record readers. This enables Hadoop to read big files of moving object trajectories such as vehicle GPS tracks and split them over worker nodes for distributed processing. The storage layer is also extended by spatiotemporal indexes that help filtering the data before splitting it over the worker nodes. Several data access functions are provided so that the MapReduce layer can deal with this data. The MapReduce layer is extended with trajectory processing operators, to compute for instance the length of a trajectory in meters. This paper describes the extension and evaluates it using a synthetic dataset and a real dataset. Comparisons with non-Hadoop systems and with standard Hadoop are given. The extension accounts for about 11,601 lines of Java code.
机译:位置跟踪技术的最近进步和广泛使用位置感知应用程序导致移动对象轨迹的大数据集。虽然存在一些用于移动对象数据库的研究原型,但缺乏可以处理大量时空数据的系统。这项工作提出了Hadooptrajectory,一种用于时空数据处理的Hadoop扩展。扩展将时空类型和操作员添加到Hadoop核心。这些类型和运算符可以直接用于MapReduce程序,它为Hadoop用户提供了编写时空数据分析程序的可能性。 Hadoop的存储层,HDFS,通过类型扩展,以表示轨迹数据及其相应的输入和输出功能。它还由文件分离器和记录读取器扩展。这使Hadoop能够读取移动对象轨迹的大文件,例如车辆GPS轨道,并将其分割过往用于分布式处理的工人节点。存储层也通过时空索引扩展,从而有助于在将数据拆分在工作者节点上之前过滤数据。提供了几个数据访问功能,以便MapReduce层可以处理此数据。 MapReduce层与轨迹处理运算符扩展,以计算例如以米为单位的轨迹的长度。本文介绍了扩展,并使用合成数据集和实时数据集进行评估。给出了与非Hadoop系统和标准Hadoop的比较。扩展帐户约为11,601行的Java代码行。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号