首页> 外文期刊>Pattern Analysis and Applications >Two density-based k-means initialization algorithms for non-metric data clustering
【24h】

Two density-based k-means initialization algorithms for non-metric data clustering

机译:非度量数据聚类的两种基于密度的k均值初始化算法

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper, we propose a density-based clusters' representatives selection algorithm that identifies the most central patterns from the dense regions in the dataset. The method, which has been implemented using two different strategies, is applicable to input spaces with no trivial geometry. Our approach exploits a probability density function built through the Parzen estimator, which relies on a (not necessarily metric) dissimilarity measure. Being a representatives extractor a general-purpose algorithm, our method is obviously applicable in different contexts. However, to test the proposed procedure, we specifically consider the problem of initializing the k-means algorithm. We face problems defined on standard real-valued vectors, labeled graphs, and finally sequences of real-valued vectors and sequences of characters. The obtained results demonstrate the effectiveness of the proposed representative selection method with respect to other state-of-the-art solutions.
机译:在本文中,我们提出了一种基于密度的聚类代表选择算法,该算法从数据集中的密集区域中识别出最中心的模式。该方法已使用两种不同的策略实施,适用于没有平凡几何形状的输入空间。我们的方法利用了通过Parzen估计器建立的概率密度函数,该函数依赖于(不一定是度量)不相似性度量。作为一种通用算法的代表提取器,我们的方法显然适用于不同的环境。但是,为了测试建议的过程,我们特别考虑了初始化k-means算法的问题。我们面临的问题是在标准实值向量,带标签的图以及最终的实值向量序列和字符序列上定义的问题。获得的结果证明了所提出的代表性选择方法相对于其他最新解决方案的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号