Distributed Subtrajectory Join on Massive Datasets

PANAGIOTIS TAMPAKIS; CHRISTOS DOULKERIDIS; NIKOS PELEKIS; YANNIS THEODORIDIS

首页> 外文期刊>ACM Transactions on Spatial Algorithms and Systems >Distributed Subtrajectory Join on Massive Datasets

【24h】

Distributed Subtrajectory Join on Massive Datasets

机译：分布式子标记加入大规模数据集

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Joining trajectory datasets is a significant operation in mobility data analytics and the cornerstone of various methods that aim to extract knowledge out of them. In the era of Big Data, the production of mobility data has become massive and, consequently, performing such an operation in a centralized way is not feasible. In this article, we address the problem of Distributed Subtrajectory Join processing by utilizing the MapReduce programming model. Compared to traditional trajectory join queries, this problem is even more challenging since the goal is to retrieve all the "maximal" portions of trajectories that are "similar." We propose three solutions: (ⅰ) a well-designed basic solution, coined DTJb; (ⅱ) a solution that uses a preprocessing step that repartitions the data, labeled DTJr; and (ⅲ) a solution that, additionally, employs an indexing scheme, named DTJi. In our experimental study, we utilize a 56GB dataset of real trajectories from the maritime domain, which, to the best of our knowledge, is the largest real dataset used for experimentation in the literature of trajectory data management. The results show that DTJi performs up to 16x faster compared with DTJb, 10× faster than DTJr, and 3× faster than the closest related state-of-the-art algorithm.

机译：加入轨迹数据集是移动数据分析的重要操作，以及各种方法的基石，旨在从其中提取知识。在大数据的时代，流动性数据的生产变得巨大，因此，以集中方式执行这种操作是不可行的。在本文中，我们通过利用MapReduce编程模型来解决分布式子标记加入处理的问题。与传统的轨迹加入查询相比，此问题更具挑战性，因为目标是检索“类似”的轨迹的所有“最大”部分。我们提出了三种解决方案：（Ⅰ）设计精心设计的基本解决方案，Coined DTJB; （Ⅱ）使用重新处理数据的预处理步骤的解决方案，标记为DTJR; （Ⅲ）另外，使用指定方案的解决方案命名为DTJI。在我们的实验研究中，我们利用了来自海上域的56GB数据集，这是我们所知的最大的真实数据集，用于轨迹数据管理文献中的实验。结果表明，与DTJB相比，DTJI比DTJR更快地执行16倍，比最接近的相关最新的算法快3×。

著录项

来源
《ACM Transactions on Spatial Algorithms and Systems》 |2020年第2期|8.1-8.29|共29页
作者
PANAGIOTIS TAMPAKIS; CHRISTOS DOULKERIDIS; NIKOS PELEKIS; YANNIS THEODORIDIS;
展开▼
作者单位

University of Piraeus;

University of Piraeus;

University of Piraeus;

University of Piraeus;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
(Sub)Trajectory join; distributed join processing; MapReduce;

机译：（子）轨迹加入;分布式加入处理;Mapreduce.;

相似文献

外文文献
中文文献
专利

1. Laserchicken—A tool for distributed feature calculation from massive LiDAR point cloud datasets [J] . C. Meijer, M.W. Grootes, Z. Koma, SoftwareX . 2020,第2期

机译：LaserChickics-A来自MATHALIVE LIDAR点云数据集的分布功能计算的工具
2. Grid-based architecture for sharing distributed massive datasets [J] . Mohammed Bakri Bashir, Muhammad Shafie Abd Latiff, Adil Yousif International journal of communication networks and distributed systems . 2015,第2a3期

机译：基于网格的架构，用于共享分布式海量数据集
3. Distributed multipliers in MWM for analyzing job arrival processes in massive HPC workload datasets [J] . Jing Wen, Yan Ma, Peng Liu, Future generation computer systems . 2014,第jula期

机译：MWM中的分布式乘数，用于分析海量HPC工作负载数据集中的工作到达过程
4. Distributed Spatial Join Processing for Multiple Spatial Datasets - Multi-way Spatial Join [C] . Cunha Anderson Rogerio, Teles de Oliveira Savio Salvarino, Borges de Oliveira Thiago, Brazilian Symposium on Computer Networks and Distributed Systems . 2015

机译：多个空间数据集的分布式空间联接处理-多路空间联接
5. Combinatorial Optimization on Massive Datasets: Streaming, Distributed, and Massively Parallel Computation [D] . Assadi, Sepehr. 2018

机译：大规模数据集的组合优化：流式，分布式和大规模并行计算
6. Random access with a distributed Bitmap Join Index for Star Joins [O] . Jaqueline J. Brito, Thiago Mosqueiro, Ricardo R. Ciferri, 2020

机译：使用星型联接的分布式位图联接索引进行随机访问
7. Distributed Subtrajectory Join on Massive Datasets [O] . Panagiotis Tampakis, Christos Doulkeridis, Nikos Pelekis, 2020

机译：分布式子标记加入大规模数据集
8. Managing Massive Datasets from Distributed Tactical Operations. [R] . Andersson, D., Skagert, C. 2004

机译：从分布式战术操作管理海量数据集。

Distributed Subtrajectory Join on Massive Datasets

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅