首页> 外文会议>International Conference on Data Warehousing and Knowledge Discovery >Clustering-Based Histograms for Multi-dimensional Data
【24h】

Clustering-Based Histograms for Multi-dimensional Data

机译:基于聚类的多维数据直方图

获取原文
获取外文期刊封面目录资料

摘要

A new technique for constructing multi-dimensional histograms is proposed. This technique first invokes a density-based clustering algorithm to locate dense and sparse regions of the input data. Then the data distribution inside each of these regions is summarized by partitioning it into non-overlapping blocks laid onto a grid. The granularity of this grid is chosen depending on the underlying data distribution: the more homogeneous the data, the coarser the grid. Our approach is compared with state-of-the-art histograms on both synthetic and real-life data and is shown to be more effective.
机译:提出了一种构建多维直方图的新技术。该技术首先调用基于密度的聚类算法来定位输入数据的密集和稀疏区域。然后通过将其分配到铺设到网格上的非重叠块中来概述这些区域中的每个区域中的数据分布。根据底层数据分布选择该网格的粒度:数据越多,粗略网格。我们的方法与合成和现实数据的最先进的直方图进行了比较,并且被证明更有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号