首页> 外文会议>International conference on very large data bases >Real-time Distributed Co-Movement Pattern Detection on Streaming Trajectories
【24h】

Real-time Distributed Co-Movement Pattern Detection on Streaming Trajectories

机译:流轨迹的实时分布式协同运动模式检测

获取原文

摘要

With the widespread deployment of mobile devices with positioning capabilities, increasingly massive volumes of trajectory data are being collected that capture the movements of people and vehicles. This data enables co-movement pattern detection, which is important in applications such as trajectory compression and future-movement prediction. Existing co-movement pattern detection studies generally consider historical data and thus propose offline algorithms. However, applications such as future movement prediction need real-time processing over streaming trajectories. Thus, we investigate real-time distributed co-movement pattern detection over streaming trajectories. Existing off-line methods assume that all data is available when the processing starts. Nevertheless, in a streaming setting, unbounded data arrives in real time, making pattern detection challenging. To this end, we propose a framework based on Apache Flink, which is designed for efficient distributed streaming data processing. The framework encompasses two phases: clustering and pattern enumeration. To accelerate the clustering, we use a range join based on two-layer indexing, and provide techniques that eliminate unnecessary verifications. To perform pattern enumeration efficiently, we present two methods FBA and VBA that utilize id-based partitioning. When coupled with bit compression and candidate-based enumeration techniques, we reduce the enumeration cost from exponential to linear. Extensive experiments offer insight into the efficiency of the proposed framework and its constituent techniques compared with existing methods.
机译:随着具有定位功能的移动设备的广泛部署,正在收集越来越多的轨迹数据,以捕获人员和车辆的运动。该数据可以实现共同运动模式检测,这在诸如轨迹压缩和未来运动预测之类的应用中很重要。现有的同动模式检测研究通常考虑历史数据,因此提出了离线算法。但是,诸如未来运动预测之类的应用程序需要对流轨迹进行实时处理。因此,我们研究了在流轨迹上的实时分布式共同运动模式检测。现有的脱机方法假定处理开始时所有数据均可用。然而,在流设置中,无限制的数据是实时到达的,这使得模式检测具有挑战性。为此,我们提出了一个基于Apache Flink的框架,该框架旨在实现高效的分布式流数据处理。该框架包括两个阶段:聚类和模式枚举。为了加速群集,我们使用基于两层索引的范围联接,并提供消除不必要验证的技术。为了有效地执行模式枚举,我们提出了两种利用基于ID的分区的方法FBA和VBA。当结合位压缩和基于候选的枚举技术时,我们将枚举成本从指数级降低到线性级。与现有方法相比,大量的实验提供了对所提出的框架及其构成技术的效率的洞察力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号