首页> 外文期刊>The VLDB journal >Performance analysis of a dual-tree algorithm for computing spatial distance histograms
【24h】

Performance analysis of a dual-tree algorithm for computing spatial distance histograms

机译:计算空间距离直方图的双树算法的性能分析

获取原文
获取原文并翻译 | 示例
           

摘要

Many scientific and engineering fields produce large volume of spatiotemporal data. The storage, retrieval, and analysis of such data impose great challenges to database systems design. Analysis of scientific spatiotemporal data often involves computing functions of all point-to-point interactions. One such analytics, the Spatial Distance Histogram (SDH), is of vital importance to scientific discovery. Recently, algorithms for efficient SDH processing in large-scale scientific databases have been proposed. These algorithms adopt a recursive tree-traversing strategy to process point-to-point distances in the visited tree nodes in batches, thus require less time when compared to the brute-force approach where all pairwise distances have to be computed. Despite the promising experimental results, the complexity of such algorithms has not been thoroughly studied. In this paper, we present an analysis of such algorithms based on a geometric modeling approach. The main technique is to transform the analysis of point counts into a problem of quan- tifying the area of regions where pairwise distances can be processed in batches by the algorithm. From the analysis, we conclude that the number of pairwise distances that are left to be processed decreases exponentially with more levels of the tree visited. This leads to the proof of a time complexity lower than the quadratic time needed for a brute-force algorithm and builds the foundation for a constant-time approximate algorithm. Our model is also general in that it works for a wide range of point spatial distributions, histogram types, and space-partitioning options in building the tree.
机译:许多科学和工程领域产生大量的时空数据。此类数据的存储,检索和分析对数据库系统设计提出了巨大挑战。科学的时空数据分析通常涉及所有点对点交互的计算功能。一种这样的分析,即空间距离直方图(SDH),对于科学发现至关重要。近来,已经提出了在大规模科学数据库中用于有效SDH处理的算法。这些算法采用递归树遍历策略来批量处理访问树节点中的点对点距离,因此与必须计算所有成对距离的蛮力方法相比,所需时间更少。尽管有令人鼓舞的实验结果,但尚未对此类算法的复杂性进行深入研究。在本文中,我们将基于几何建模方法对此类算法进行分析。主要技术是将点数的分析转换为量化该算法可以成对处理成对距离的区域面积的问题。通过分析,我们得出结论,随着访问的树的级别增加,剩下要处理的成对距离的数量呈指数下降。这导致了时间复杂度低于暴力算法所需二次时间的证明,并为恒定时间近似算法奠定了基础。我们的模型也是通用的,因为它在构建树时适用于范围广泛的点空间分布,直方图类型和空间分区选项。

著录项

  • 来源
    《The VLDB journal》 |2011年第4期|p.617-640|共24页
  • 作者单位

    Department of Mathematics, Wuhan University of Technology, 122 Luosi Road, 430070 Wuhan, Hubei, People's Republic of China;

    Department of Computer Science and Engineering, The University of South Florida, 4202 E. Fowler Ave., ENB118, Tampa, FL 33620, USA;

    Computer and Information Science Department, Indiana University-Purdue University Indianapolis, 723 W. Michigan St., SL280, Indianapolis, IN 46202, USA;

  • 收录信息 美国《科学引文索引》(SCI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    scientific databases; correlation function; quad-tree; spatial distance histogram;

    机译:科学数据库;相关函数四叉树空间距离直方图;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号