【24h】

Clustering using a genetic fuzzy least median of squares algorithm

机译:使用遗传模糊最小二乘算法进行聚类

获取原文

摘要

Reliable clustering of a severely contaminated data must depend on robust estimation methods to determine the cluster prototypes. The Least Median of Squares (LMedS) can estimate the parameters of a single prototype with a 50% breakdown point. However this breakdown point cannot be achieved when a data set consists of multiple clusters. In addition to this limitation, the objective function of the LMedS is neither amenable to analytical optimization nor to numerical optimization because of its nondifferentiability. Therefore, a tedious and time-consuming random sampling process is usually performed to search the solution space. In this paper, we first generalize the LMedS to allow the simultaneous estimation of multiple prototypes. Then we propose the use of fuzzy memberships to make this method suitable for more complex data sets. Finally, we use a genetic algorithm to provide a fast and reliable optimization of the proposed objective functions.
机译:严重污染数据的可靠聚类必须依赖可靠的估计方法来确定聚类原型。最小二乘平方(LMedS)可以估计击穿点为50%的单个原型的参数。但是,当数据集包含多个群集时,无法实现此故障点。除此限制外,LMedS的目标函数由于不具有可微性,因此既不适合分析优化也不适合数值优化。因此,通常执行繁琐且耗时的随机采样过程来搜索解空间。在本文中,我们首先对LMedS进行泛化,以允许同时估计多个原型。然后,我们提出使用模糊隶属度使该方法适用于更复杂的数据集。最后,我们使用遗传算法为提出的目标函数提供快速可靠的优化。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号