首页> 外文期刊>Knowledge and information systems >Clustering spatial data with a geographic constraint: exploring local search
【24h】

Clustering spatial data with a geographic constraint: exploring local search

机译:用地理约束对空间数据进行聚类:探索本地搜索

获取原文
获取原文并翻译 | 示例
           

摘要

Spatial data objects that possess attributes in the optimization domain and the geographic domain are now widely available. For example, sensor data are one kind of spatial data objects. The location of a sensor is an attribute in the geographic domain, while its reading is an attribute in the optimization domain. Previous studies discuss dual clustering problems that attempt to partition spatial data objects into several groups, such that objects in the same group have similar values in their optimization attributes and form a compact region in the geographic domain. However, previous studies do not clearly define compact regions. Therefore, this paper formulates a connective dual clustering problem with an explicit connected constraint given. Objects with a geographic distance smaller than or equal to the connected constraint are connected. The goal of the connective dual clustering problem is to derive clusters that contain objects with similar values in the optimization domain and are connected in the geographic domain. This study further proposes an algorithm CLS (Clustering with Local Search) to efficiently derive clusters. This algorithm consists of two phases: the ConGraph (standing for Connective Graph) transformation phase and the clustering phase. In the ConGraph transformation phase, CLS first transforms the data objects into a ConGraph that captures geographic constraints among data objects and selects initial seeds for clustering. Then, the initial seeds selected nearby data objects and formed coarse clusters by exploring local search in the clustering phase. Moreover, coarse clusters are merged and finely turned. Experiments show that CLS algorithm is more efficient and scalable than existing methods.
机译:在优化域和地理域中具有属性的空间数据对象现已广泛可用。例如,传感器数据是一种空间数据对象。传感器的位置是地理域中的一个属性,而其读数是优化域中的一个属性。先前的研究讨论了双重聚类问题,这些问题试图将空间数据对象分成几个组,以使同一组中的对象的优化属性具有相似的值,并在地理域中形成一个紧凑的区域。但是,先前的研究并未明确定义紧凑区域。因此,本文提出了具有明确连接约束的连接对偶聚类问题。连接地理距离小于或等于连接约束的对象。连通对偶聚类问题的目的是导出在优化域中包含具有相似值的对象并在地理域中连接的聚类。这项研究还提出了一种算法CLS(使用本地搜索进行聚类)以有效地导出聚类。该算法包括两个阶段:ConGraph(代表连接图)转换阶段和聚类阶段。在ConGraph转换阶段,CLS首先将数据对象转换为ConGraph,以捕获数据对象之间的地理约束并选择初始种子进行聚类。然后,初始种子通过在聚类阶段探索本地搜索来选择附近的数据对象并形成粗聚类。而且,粗簇被合并并精细地旋转。实验表明,CLS算法比现有方法具有更高的效率和可扩展性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号