首页> 外文期刊>ETRI journal >Data Clustering Method Using a Modified Gaussian Kernel Metric and Kernel PCA
【24h】

Data Clustering Method Using a Modified Gaussian Kernel Metric and Kernel PCA

机译:使用修改的高斯内核度量标准和内核PCA的数据聚类方法

获取原文
           

摘要

Most hyper-ellipsoidal clustering (HEC) approaches use the Mahalanobis distance as a distance metric. It has been proven that HEC, under this condition, cannot be realized since the cost function of partitional clustering is a constant. We demonstrate that HEC with a modified Gaussian kernel metric can be interpreted as a problem of finding condensed ellipsoidal clusters (with respect to the volumes and densities of the clusters) and propose a practical HEC algorithm that is able to efficiently handle clusters that are ellipsoidal in shape and that are of different size and density. We then try to refine the HEC algorithm by utilizing ellipsoids defined on the kernel feature space to deal with more complex-shaped clusters. The proposed methods lead to a significant improvement in the clustering results over K-means algorithm, fuzzy C-means algorithm, GMM-EM algorithm, and HEC algorithm based on minimum-volume ellipsoids using Mahalanobis distance.
机译:大多数超椭圆聚类(HEC)方法使用Mahalanobis距离作为距离度量。已经证明,在这种情况下,HEC无法实现,因为分区聚类的成本函数是常数。我们证明具有修改的高斯内核度量标准的HEC可以被解释为发现浓缩的椭圆形簇(相对于群集的卷和密度)并提出一种能够有效处理椭圆形的簇的实用HEC算法形状,尺寸和密度不同。然后,我们尝试通过利用内核特征空间上定义的椭圆体来优化HEC算法来处理更复杂的群集。所提出的方法导致聚类结果的显着改善,通过K均值算法,模糊C型算法,GMM-EM算法和基于Mahalanobis距离的最小容量椭圆体的HEC算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号