首页> 外文会议>Advances in Knowledge Discovery and Data Mining; Lecture Notes in Artificial Intelligence; 4426 >Density-Based Data Clustering Algorithms for Lower Dimensions Using Space-Filling Curves
【24h】

Density-Based Data Clustering Algorithms for Lower Dimensions Using Space-Filling Curves

机译:基于空间填充曲线的低维密度数据聚类算法

获取原文
获取原文并翻译 | 示例

摘要

We present two new density-based algorithms for clustering data points in lower dimensions (dimensions ≤ 10). Both our algorithms compute density-based clusters and noises in O(n) CPU time, space, and I/O cost, under some reasonable assumptions, where n is the number of input points. Besides packing the data structure into buckets and using block access techniques to reduce the I/O cost, our algorithms apply space-filling curve techniques to reduce the disk access operations. Our first algorithm (Algorithm A) focuses on handling not highly clustered input data, while the second algorithm (Algorithm B) focuses on highly clustered input data. We implemented our algorithms, evaluated the effects of various space-filling curves, identified the best space-filling curve for our approaches, and conducted extensive performance evaluation. The experiments show the high performance of our algorithms. We believe that our algorithms are of considerable practical value.
机译:我们提出了两种基于密度的新算法,用于在较低维度(尺寸≤10)中对数据点进行聚类。在某些合理的假设下,我们的两种算法都以O(n)CPU时间,空间和I / O成本计算基于密度的群集和噪声,其中n是输入点数。除了将数据结构打包到存储桶中并使用块访问技术以减少I / O成本外,我们的算法还使用空间填充曲线技术来减少磁盘访问操作。我们的第一个算法(算法A)专注于处理非高度聚类的输入数据,而第二个算法(算法B)专注于高度聚类的输入数据。我们实施了算法,评估了各种空间填充曲线的效果,确定了适合我们方法的最佳空间填充曲线,并进行了广泛的性能评估。实验证明了我们算法的高性能。我们认为我们的算法具有相当大的实用价值。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号