首页> 外文会议>International Conference on Data Warehousing and Knowledge Discovery(DaWaK 2006) >Calculation of Density-Based Clustering Parameters Supported with Distributed Processing
【24h】

Calculation of Density-Based Clustering Parameters Supported with Distributed Processing

机译:分布式处理支持的基于密度的聚类参数的计算

获取原文

摘要

In today's world of data-mining applications there is a strong need for processing spatial data. Spatial objects clustering is often a crucial operation in applications such as traffic-tracking systems or telemetry-oriented systems. Our current research is focused on providing an efficient caching structure for a telemetric data warehouse. We perform spatial objects clustering for every level of the structure. For this purpose we employ a density-based clustering algorithm. However efficient and scalable, the algorithm requires an user-defined parameter Eps. As we cannot get the Eps from user for every level of the structure we propose a heuristic approach for calculating the Eps parameter. Automatic Eps Calculation (AEC) algorithm analyzes pairs of points defining two quantities: distance between the points and density of the stripe between the points. In this paper we describe in detail the algorithm operation and interpretation of the results. The AEC algorithm was implemented in both centralized and distributed version. Included test results compare the two versions and verify the AEC algorithm correctness against various datasets.
机译:在今天的数据挖掘应用程序中,有强烈需要处理空间数据。空间对象聚类通常是交通跟踪系统或遥测的系统等应用中的重要操作。我们目前的研究专注于为遥测数据仓库提供有效的缓存结构。我们对结构的每个级别执行空间对象聚类。为此目的,我们使用基于密度的聚类算法。但是,算法需要高效且可扩展,需要用户定义的参数EPS。由于我们无法从用户那里获得每个结构级别的EPS,我们提出了一种用于计算EPS参数的启发式方法。自动EPS计算(AEC)算法分析定义两次的点对:点之间条带之间的点和密度之间的距离。在本文中,我们详细描述了结果的算法操作和解释。 AEC算法在集中式和分布式版本中实现。包含的测试结果比较两个版本并验证AEC算法对各种数据集的正确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号