首页> 外文期刊>Biometrika >Clustering methods based on variational analysis in the space of measures
【24h】

Clustering methods based on variational analysis in the space of measures

机译:度量空间中基于变异分析的聚类方法

获取原文
获取原文并翻译 | 示例
           

摘要

We formulate clustering as a minimisation problem in the space of measures by modelling the cluster centres as a Poisson process with unknown intensity function. We derive a Ward-type clustering criterion which, under the Poisson assumption, can easily be evaluated explicitly in terms of the intensity function. We show that asymptotically, i.e. for increasing total intensity, the optimal intensity function is proportional to a dimension-dependent power of the density of the observations. For fixed finite total intensity, no explicit solution seems available. However, the Ward-type criterion to be minimised is convex in the intensity function, so that the steepest descent method of Molchanov & Zuyev (2001) can be used to approximate the global minimum. It turns out that the gradient is similar in form to the functional to be optimised. If we discretise over a grid, the steepest descent algorithm at each iteration step increases the current intensity function at those points where the gradient is minimal at the expense of regions with a large gradient value. The algorithm is applied to a toy one-dimensional example, a simulation from a popular spatial cluster model and a real-life dataset from Strauss (1975) concerning the positions of redwood seedlings. Finally, we discuss the relative merits of our approach compared to classical hierarchical and partition clustering techniques as well as to modern model based clustering methods using Markov point processes and mixture distributions.
机译:通过将聚类中心建模为强度函数未知的泊松过程,我们将聚类公式化为度量空间中的最小化问题。我们推导了Ward型聚类标准,在Poisson假设下,可以根据强度函数轻松地对其进行显式评估。我们表明,渐近地,即为了增加总强度,最佳强度函数与观测密度的维数相关。对于固定的有限总强度,似乎没有明确的解决方案。但是,要最小化的Ward型准则在强度函数中是凸的,因此Molchanov&Zuyev(2001)的最速下降法可以用来近似全局最小值。事实证明,梯度在形式上与要优化的功能相似。如果在网格上离散化,则在每个迭代步骤中最速下降算法都会在梯度最小的那些点上增加电流强度函数,但会牺牲梯度值较大的区域。该算法被应用于玩具的一维示例,流行的空间聚类模型的仿真以及Strauss(1975)的关于红木幼苗位置的真实数据集。最后,我们讨论了与经典的层次和分区聚类技术以及使用马尔可夫点过程和混合分布的基于现代模型的聚类方法相比,该方法的相对优点。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号