首页> 外文会议>IEEE International Conference on Cognitive Informatics Cognitive Computing >Hybrid Air Mass Collision Based Optimization Algorithm for Data Cluster Problems
【24h】

Hybrid Air Mass Collision Based Optimization Algorithm for Data Cluster Problems

机译:基于混合气团碰撞的数据簇优化算法

获取原文

摘要

In data mining, clustering is an important data analysis concept. It plays a vital role in extracting the useful hidden knowledge from large input datasets. This unsupervised technique partitions the input dataset into groups called clusters. The data objects mapping is done into clusters such clusters should maintain similarity between the objects within same cluster and dissimilarity between the data objects in different clusters. In this process factors like distance measuring techniques, initial conditions and criterion functions playa key role in finding optimal clusters of data. Many optimization algorithms have come into existence to resolve these types of optimization problems. But still finding optimal clusters is a big challenging task. This work presents hybrid version of the recently devised nature-inspired algorithm i.e. Tornadogenesis Optimization Algorithm (TOA) for solving data clustering problems using BB-BC. We framed this work in two phases wherein the first phase testing for optimization performance on 23 standard mathematical benchmark functions took place, in the second phase numerical ability is tested by applying hybridized Tornadogenesis Optimization Algorithm (HTOA) on 10 real-world data clustering problems. In addition to that various distance measuring techniques used to test the improvement in clustering performance. We portrayed the obtained results in tabular and graphical forms. Various analysis and comparisons have been made and found that the performance of proposed HTOA is good at solving data clustering problems using Euclidean distance measuring technique.
机译:在数据挖掘中,聚类是重要的数据分析概念。它在从大型输入数据集中提取有用的隐藏知识方面起着至关重要的作用。这种无监督技术将输入数据集划分为称为簇的组。数据对象映射已完成到群集中,这样的群集应在同一群集中的对象之间保持相似性,而在不同群集中的数据对象之间应保持相似性。在此过程中,诸如距离测量技术,初始条件和准则函数之类的因素在寻找最佳数据簇中起着关键作用。为了解决这些类型的优化问题,已经出现了许多优化算法。但是仍然要找到最佳集群是一项艰巨的任务。这项工作提出了最近设计的自然启发算法的混合版本,即用于生成遗传优化算法(TOA)的解决方案,该算法使用BB-BC解决数据聚类问题。我们将这项工作分为两个阶段,其中第一阶段针对23个标准数学基准函数进行了优化性能测试,第二阶段通过对10个现实世界数据聚类问题应用混合遗传优化算法(HTOA)测试了数字能力。除此之外,还使用了各种距离测量技术来测试聚类性能的提高。我们以表格和图形形式描绘了获得的结果。进行了各种分析和比较,发现所提出的HTOA的性能很好地解决了使用欧几里德测距技术解决的数据聚类问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号