首页> 外文会议>Twenty-ninth International Conference on Very Large Databases; Sep 9-12, 2003; Berlin, Germany >Multiscale Histograms: Summarizing Topological Relations in Large Spatial Datasets
【24h】

Multiscale Histograms: Summarizing Topological Relations in Large Spatial Datasets

机译:多尺度直方图:总结大型空间数据集中的拓扑关系

获取原文
获取原文并翻译 | 示例

摘要

Summarizing topological relations is fundamental to many spatial applications including spatial query optimization. In this paper, we present several novel techniques to effectively construct cell density based spatial histograms for range (window) summarizations restricted to the four most important topological relations: contains, contained, overlap, and disjoint. We first present a novel framework to construct a multiscale histogram composed of multiple Euler histograms with the guarantee of the exact summarization results for aligned windows in constant time. Then we present an approximate algorithm, with the approximate ratio 19/12, to minimize the storage spaces of such multiscale Euler histograms, although the problem is generally NP-hard. To conform to a limited storage space where only k Euler histograms are allowed, an effective algorithm is presented to construct multiscale histograms to achieve high accuracy. Finally, we present a new approximate algorithm to query an Euler histogram that cannot guarantee the exact answers; it runs in constant time. Our extensive experiments against both synthetic and real world datasets demonstrated that the approximate multiscale histogram techniques may improve the accuracy of the existing techniques by several orders of magnitude while retaining the cost efficiency, and the exact multiscale histogram technique requires only a storage space linearly proportional to the number of cells for the real datasets.
机译:汇总拓扑关系是许多空间应用程序(包括空间查询优化)的基础。在本文中,我们提出了几种新颖的技术,可以有效地构建基于单元密度的空间直方图,用于范围(窗口)汇总,该汇总仅限于四个最重要的拓扑关系:包含,包含,重叠和不相交。我们首先提出一种新颖的框架,以构造由多个欧拉直方图组成的多尺度直方图,并保证在恒定时间内对齐窗口的精确汇总结果。然后,我们提出一种近似算法,其近似比率为19/12,以最小化此类多尺度Euler直方图的存储空间,尽管问题通常是NP-hard的。为了适应仅允许k个Euler直方图的有限存储空间,提出了一种有效的算法来构造多尺度直方图以实现高精度。最后,我们提出了一种新的近似算法来查询不能保证确切答案的欧拉直方图。它以恒定的时间运行。我们针对合成数据集和现实世界数据集进行的广泛实验表明,近似的多尺度直方图技术可以将现有技术的准确性提高几个数量级,同时保持成本效益,而精确的多尺度直方图技术只需要与实际数据集的像元数。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号