首页> 外文会议>International conference on algorithms and architectures for parallel processing >Real-Time Data Stream Partitioning over a Sliding Window in Real-Time Spatial Big Data
【24h】

Real-Time Data Stream Partitioning over a Sliding Window in Real-Time Spatial Big Data

机译:实时空间大数据中滑动窗口上的实时数据流分区

获取原文
获取外文期刊封面目录资料

摘要

In recent years, real-time spatial applications, like location-aware services and traffic monitoring, have become more and more important. Such applications result in dynamic environments where data, as well as queries, are continuously moving. As a result, there is a tremendous amount of real-time spatial data generated every day. The growth of the data volume seems to outspeed the advance of our computing infrastructure. For instance, in real-time spatial Big Data, users expect to receive the results of each query within a short time period without holding into account the load of the system. But with a huge amount of real-time spatial data generated, the system performance degrades rapidly, especially in overload situations. To solve this problem, we propose the use of data partitioning as an optimization technique. Traditional horizontal and vertical partitioning can increase the performance of the system and simplify data management. But they remain insufficient for real-time spatial Big data; they can't deal with real-time and stream queries efficiently. Thus, in this paper, we propose a novel data partitioning approach over a sliding window in real-time spatial Big Data named VPA-RTSBD (Vertical Partitioning Approach for Real-Time Spatial Big data). This contribution is an implementation of the Matching algorithm for traditional vertical partitioning. We find, firstly, the optimal attributes sequence by the use of the Matching algorithm. Then, we propose a new cost model used for database partitioning, for keeping the data amount of each partition more balanced limit and for providing a parallel execution guarantee for the most frequent queries. VPA-RTSBD aims to obtain a real-time partitioning scheme and deals with stream data. It improves the performance of query execution by maximizing the degree of parallel execution. This affects QoS (Quality Of Service) improvement in real-time spatial Big Data especially with a huge volume of stream data. The performance of our contribution is evaluated via simulation experiments. The results show that the proposed algorithm is both efficient and scalable and that it outperforms comparable algorithms.
机译:近年来,诸如位置感知服务和交通监控之类的实时空间应用变得越来越重要。这样的应用程序会导致动态环境,其中数据以及查询在不断变化。结果,每天都会产生大量的实时空间数据。数据量的增长似乎加快了我们计算基础架构的发展速度。例如,在实时空间大数据中,用户希望在短时间内接收每个查询的结果,而无需考虑系统的负载。但是,由于生成了大量的实时空间数据,因此系统性能会迅速下降,尤其是在过载情况下。为了解决这个问题,我们建议使用数据分区作为一种优化技术。传统的水平和垂直分区可以提高系统性能并简化数据管理。但是对于实时空间大数据而言,它们仍然不足。他们无法有效地处理实时和流式查询。因此,在本文中,我们提出了一种在实时空间大数据中的滑动窗口上进行数据划分的新方法,称为VPA-RTSBD(实时空间大数据的垂直划分方法)。该贡献是用于传统垂直分区的Matching算法的一种实现。我们首先通过匹配算法找到最优属性序列。然后,我们提出了一种用于数据库分区的新成本模型,用于使每个分区的数据量保持更均衡的限制,并为最频繁的查询提供并行执行保证。 VPA-RTSBD旨在获得实时分区方案并处理流数据。它通过最大化并行执行的程度来提高查询执行的性能。这会影响实时空间大数据中QoS(服务质量)的提高,尤其是在具有大量流数据的情况下。我们的贡献的表现是通过模拟实验进行评估的。结果表明,该算法既高效又可扩展,性能优于同类算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号