首页> 外文期刊>Geo-Spatial Information Science >DCAD: a Dual Clustering Algorithm for Distributed Spatial Databases
【24h】

DCAD: a Dual Clustering Algorithm for Distributed Spatial Databases

机译:DCAD:分布式空间数据库的双重群集算法

获取原文
获取原文并翻译 | 示例
           

摘要

Spatial objects have two types of attributes: geometrical attributes and non-geometrical attributes, which belong to two different attribute domains (geometrical and non-geometrical domains). Although geometrically scattered in a geometrical domain, spatial objects may be similar to each other in a non-geometrical domain. Most existing clustering algorithms group spatial datasets into different compact regions in a geometrical domain without considering the aspect of a non-geometrical domain. However, many application scenarios require clustering results in which a cluster has not only high proximity in a geometrical domain, but also high similarity in a non-geometrical domain. This means constraints are imposed on the clustering goal from both geometrical and non-geometrical domains simultaneously. Such a clustering problem is called dual clustering. As distributed clustering applications become more and more popular, it is necessary to tackle the dual clustering problem in distributed databases. The DCAD algorithm is proposed to solve this problem. DCAD consists of two levels of clustering: local clustering and global clustering. First, clustering is conducted at each local site with a local clustering algorithm, and the features of local clusters are extracted. Second, local features from each site are sent to a central site where global clustering is obtained based on those features. Experiments on both artificial and real spatial datasets show that DCAD is effective and efficient.
机译:空间对象具有两种类型的属性:几何属性和非几何属性,它们属于两个不同的属性域(几何域和非几何域)。尽管几何上分散在几何域中,但空间对象在非几何域中可能彼此相似。大多数现有的聚类算法将空间数据集在几何域中划分为不同的紧凑区域,而无需考虑非几何域的方面。但是,许多应用场景需要聚类结果,其中聚类不仅在几何域中具有很高的邻近度,而且在非几何域中也具有很高的相似性。这意味着同时从几何和非几何领域对聚类目标施加了约束。这样的聚类问题称为双重聚类。随着分布式集群应用程序变得越来越流行,有必要解决分布式数据库中的双重集群问题。提出了DCAD算法来解决这个问题。 DCAD包含两个级别的集群:本地集群和全局集群。首先,使用局部聚类算法在每个本地站点进行聚类,并提取局部聚类的特​​征。其次,将每个站点的本地功能发送到中央站点,在该中心基于这些功能获得全局聚类。在人造和真实空间数据集上的实验表明,DCAD是有效的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号