...
首页> 外文期刊>LIPIcs : Leibniz International Proceedings in Informatics >What Makes Spatial Data Big? A Discussion on How to Partition Spatial Data
【24h】

What Makes Spatial Data Big? A Discussion on How to Partition Spatial Data

机译:是什么使空间数据变大?关于如何对空间数据进行分区的讨论

获取原文
   

获取外文期刊封面封底 >>

       

摘要

The amount of available spatial data has significantly increased in the last years so that traditional analysis tools have become inappropriate to effectively manage them. Therefore, many attempts have been made in order to define extensions of existing MapReduce tools, such as Hadoop or Spark, with spatial capabilities in terms of data types and algorithms. Such extensions are mainly based on the partitioning techniques implemented for textual data where the dimension is given in terms of the number of occupied bytes. However, spatial data are characterized by other features which describe their dimension, such as the number of vertices or the MBR size of geometries, which greatly affect the performance of operations, like the spatial join, during data analysis. The result is that the use of traditional partitioning techniques prevents to completely exploit the benefit of the parallel execution provided by a MapReduce environment. This paper extensively analyses the problem considering the spatial join operation as use case, performing both a theoretical and an experimental analysis for it. Moreover, it provides a solution based on a different partitioning technique, which splits complex or extensive geometries. Finally, we validate the proposed solution by means of some experiments on synthetic and real datasets.
机译:过去几年中,可用空间数据的数量已大大增加,因此传统的分析工具已变得不适用于有效地对其进行管理。因此,进行了许多尝试,以定义现有MapReduce工具(例如Hadoop或Spark)的扩展,这些扩展具有就数据类型和算法而言的空间功能。此类扩展主要基于为文本数据实现的分区技术,其中根据占用的字节数指定维数。但是,空间数据的特征是描述其维数的其他特征,例如顶点的数量或几何的MBR大小,这在数据分析过程中会极大地影响像空间连接之类的操作的性能。结果是,使用传统的分区技术无法完全利用MapReduce环境提供的并行执行的好处。本文以空间连接操作为用例,对问题进行了广泛分析,并对其进行了理论和实验分析。此外,它提供了基于不同分区技术的解决方案,该技术可拆分复杂或广泛的几何形状。最后,我们通过对合成数据集和真实数据集进行一些实验来验证所提出的解决方案。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号