...
首页> 外文期刊>Geoinformatica: An international journal of advances of computer science for geographic >STHist-C: A highly accurate cluster-based histogram for two and three dimensional geographic data points
【24h】

STHist-C: A highly accurate cluster-based histogram for two and three dimensional geographic data points

机译:STHist-C:高精度的基于簇的直方图,用于二维和三维地理数据点

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Histograms have been widely used for estimating selectivity in query optimization. In this paper, we propose a new histogram construction method for geographic data objects that are used in many real-world applications. The proposed method is based on analyses and utilization of clusters of objects that exist in a given data set, to build histograms with significantly enhanced accuracy. Our philosophy in allocating the histogram buckets is to allocate them to the subspaces that properly capture object clusters. Therefore, we first propose a procedure to find the centers of object clusters. Then, we propose an algorithm to construct the histogram buckets from these centers. The buckets are initialized from the clusters' centers, then expanded to cover the clusters. Best expansion plans are chosen based on a notion of skewness gain. Results from extensive experiments using real-life data sets demonstrate that the proposed method can really improve the accuracy of the histograms further, when compared with the current state of the art histogram construction method for geographic data objects.
机译:直方图已广泛用于估计查询优化中的选择性。在本文中,我们为在许多实际应用中使用的地理数据对象提出了一种新的直方图构造方法。所提出的方法基于对给定数据集中存在的对象簇的分析和利用,以建立具有显着增强的准确性的直方图。分配直方图存储桶的理念是将它们分配给正确捕获对象簇的子空间。因此,我们首先提出一个程序来找到对象簇的中心。然后,我们提出了一种从这些中心构建直方图桶的算法。这些存储桶是从集群的中心初始化的,然后进行扩展以覆盖集群。根据偏度增益的概念选择最佳的扩展计划。使用现实生活中的数据集进行的大量实验的结果表明,与当前用于地理数据对象的最新直方图构造方法相比,该方法确实可以进一步提高直方图的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号