首页> 外文期刊>The Computer journal >A Comparative Study of Dual-Tree Algorithms for Computing Spatial Distance Histograms
【24h】

A Comparative Study of Dual-Tree Algorithms for Computing Spatial Distance Histograms

机译:计算空间距离直方图的双树算法的比较研究

获取原文
获取原文并翻译 | 示例
           

摘要

The 2-body correlation function (2-BCF) is a group of statistical measurements that found applications in many scientific domains. One type of 2-BCF named the Spatial Distance Histogram (SDH) is of vital importance in describing the physical features of natural systems. While a naive way of computing SDH requires quadratical time, efficient algorithms based on resolving nodes in spatial trees have been developed. A key decision in the design of such algorithms is to choose a proper underlying data structure: our previous work utilizes quad-tree (oct-tree for 3D data) and, in this paper, we study a kd-tree-based solution. Although it is easy to see that both implementations have the same time complexity O(N2d-1/d), where d is the number of dimensions of the dataset, a thorough comparison of their actual running time under different scenarios is conducted. In particular, we present an analytical model to rigorously quantify the running time of dual-tree algorithms. Our analysis suggests that the kd-tree-based implementation outperforms the quad-/oct-tree solution under a wide range of data sizes and query parameters. Specifically, such performance advantage is shown as a speedup up to 1.23x over the quad-tree algorithm for 2D data, and 1.39x over the oct-tree for 3D data, respectively. Results of extensive experiments run on synthetic and real datasets confirm our findings.
机译:2体相关函数(2-BCF)是一组统计测量值,已在许多科学领域得到应用。一种称为空间距离直方图(SDH)的2-BCF类型对于描述自然系统的物理特征至关重要。尽管朴素的SDH计算方法需要二次时间,但是已经开发了基于解析空间树中节点的有效算法。设计此类算法的关键决策是选择合适的基础数据结构:我们之前的工作使用四叉树(用于3D数据的八叉树),并且在本文中,我们研究了基于kd树的解决方案。尽管很容易看出这两个实现具有相同的时间复杂度O(N2d-1 / d),其中d是数据集的维数,但在不同情况下对它们的实际运行时间进行了全面比较。特别是,我们提出了一个分析模型来严格量化双树算法的运行时间。我们的分析表明,在大量数据大小和查询参数下,基于kd树的实现优于四叉树/八叉树解决方案。具体而言,这种性能优势显示为:对于2D数据,四叉树算法的加速比分别提高了1.23倍;对于3D数据,对八叉树的速度分别提高了1.39倍。在合成和真实数据集上进行的大量实验的结果证实了我们的发现。

著录项

  • 来源
    《The Computer journal》 |2019年第1期|42-62|共21页
  • 作者单位

    Univ S Florida, Dept Comp Sci & Engn, 4202 E Fowler Ave,ENB 118, Tampa, FL 33620 USA|Univ S Florida, Interdisciplinary Data Sci Consortium, 4202 E Fowler Ave,ENB 118, Tampa, FL 33620 USA;

    Wuhan Univ Technol, Dept Math, 122 Luosi Rd, Wuhan 430070, Hubei, Peoples R China;

    Univ S Florida, Dept Comp Sci & Engn, 4202 E Fowler Ave,ENB 118, Tampa, FL 33620 USA|Univ S Florida, Interdisciplinary Data Sci Consortium, 4202 E Fowler Ave,ENB 118, Tampa, FL 33620 USA;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    scientific databases; query processing; spatial distance histogram; performance; quad-tree; oct-tree; kd-tree;

    机译:科学数据库;查询处理;空间距离直方图;性能;四叉树;八叉树;kd树;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号