首页> 外文期刊>SN Applied Sciences >Nature‑inspired metaheuristic techniques for automatic clustering: a survey and performance study
【24h】

Nature‑inspired metaheuristic techniques for automatic clustering: a survey and performance study

机译:自然启发式自动启发式元启发式技术:一项调查和性能研究

获取原文
       

摘要

The application of several swarm intelligence and evolutionary metaheuristic algorithms in data clustering problems hasin the past few decades gained wide popularity and acceptance due to their success in solving and finding good qualitysolutions to a variety of complex real-world optimization problems. Clustering is considered one of the most importantdata analysis techniques in the domain of data mining. A clustering problem refers to the partitioning of unlabeleddata objects into a certain number of clusters based on their attribute values or features, with the objective of maximizingintra-clusters homogeneity and inter-cluster heterogeneity. This paper presents an up-to-date survey of majornature-inspired metaheuristic algorithms that have been employed to solve automatic clustering problems. Further, acomparative study of several modified well-known global metaheuristic algorithms is carried out to solve automaticclustering problems. Also, three hybrid swarm intelligence and evolutionary algorithms, namely, particle swarm differentialevolution algorithm, firefly differential evolution algorithm and invasive weed optimization differential evolutionalgorithm, are proposed to deal with the task of automatic data clustering. In contrast to many of the existing traditionaland evolutionary computational clustering techniques, the clustering algorithms presented in this paper do not requireany predetermined information or prior-knowledge of the dataset that is to be classified, but rather they are capable ofspontaneously identifying the optimal number of partitions of the data points during the course of program execution.Forty-one benchmarked datasets that comprise eleven artificial and thirty real world datasets are collated and utilizedto evaluate the performances of the representative nature-inspired clustering algorithms. According to the extensiveexperimental results, comparisons and statistical significance, the firefly algorithm appeared to be more appropriate forbetter clustering of both low and high dimensional data objects than were other state-of-the-art algorithms. Further,an experimental study demonstrates the superiority of the three proposed hybrid algorithms over the standard stateof-the-art methods in finding meaningful clustering solutions to the problem at hand.
机译:几种群智能和进化元启发式算法在数据聚类问题中的应用在过去的几十年中,由于他们成功地解决并找到了优质的产品而受到广泛的欢迎和认可解决各种复杂的现实世界优化问题的方法。集群被认为是最重要的集群之一数据挖掘领域的数据分析技术。群集问题是指未标记的分区根据数据对象的属性值或特征将数据对象划分为一定数量的群集,目的是最大化集群内同质性和集群间异质性。本文介绍了有关专业的最新调查受自然启发的元启发式算法已用于解决自动聚类问题。此外,进行了几种改进的著名全局元启发式算法的比较研究,以解决自动集群问题。此外,三种混合群智能和进化算法,即粒子群差分进化算法,萤火虫差异进化算法和入侵杂草优化差异进化提出了一种算法来处理自动数据聚类的任务。与许多现有的传统和进化计算聚类技术,本文提出的聚类算法不需要要分类的数据集的任何预定信息或先验知识,但是它们能够在程序执行过程中自发地确定数据点的最佳分区数。整理并利用了包括11个人工数据集和30个真实数据集的41个基准数据集评估具有代表性的自然启发式聚类算法的性能。据广泛实验结果,比较和统计意义,萤火虫算法似乎更适合与其他最新算法相比,低维和高维数据对象的聚类效果更好。进一步,实验研究表明,三种建议的混合算法在标准状态下的优越性寻找针对当前问题的有意义的聚类解决方案的最新方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号