...
首页> 外文期刊>Information retrieval >Data Driven Similarity Measures for k-Means Like Clustering Algorithms
【24h】

Data Driven Similarity Measures for k-Means Like Clustering Algorithms

机译:类似于聚类算法的k均值的数据驱动相似性度量

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

We present an optimization approach that generates k-means like clustering algorithms. The batch k-means and the incremental k-means are two well known versions of the classical k-means clustering algorithm (Duda et al. 2000). To benefit from the speed of the batch version and the accuracy of the incremental version we combine the two in a "ping-pong" fashion. We use a distance-like function that combines the squared Euclidean distance with relative entropy. In the extreme cases our algorithm recovers the classical k-means clustering algorithm and generalizes the Divisive Information Theoretic clustering algorithm recently reported independently by Berkhin and Becher (2002) and Dhillonl et al. (2002). Results of numerical experiments that demonstrate the viability of our approach are reported.
机译:我们提出了一种优化方法,可以生成类似于聚类算法的k均值。批处理k均值和增量k均值是经典k均值聚类算法的两个众所周知的版本(Duda et al。2000)。为了从批处理版本的速度和增量版本的准确性中受益,我们以“乒乓”方式将两者结合在一起。我们使用类似距离的函数,将平方的欧几里得距离与相对熵结合在一起。在极端情况下,我们的算法恢复了经典的k均值聚类算法,并推广了Berkhin和Becher(2002)和Dhillonl等人最近独立报告的Divisive Information Theoretic聚类算法。 (2002)。数值实验的结果表明了我们方法的可行性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号