首页> 外文会议>International Conference on Data Mining >Parallel Hybrid Clustering using Genetic Programming and Multi-Objective Fitness with Density (PYRAMID)
【24h】

Parallel Hybrid Clustering using Genetic Programming and Multi-Objective Fitness with Density (PYRAMID)

机译:使用遗传编程和密度的多目标适应性(金字塔)并行混合聚类

获取原文

摘要

Clustering is the process of locating patterns in large data sets. It is an active research area that provides value to scientific as well as business applications. Practical clustering faces several challenges including: identifying clusters of arbitrary shapes, sensitivity to the order of input, dynamic determination of the number of clusters, outlier handling, processing speed of massive data sets, handling higher dimensions, and dependence on user-supplied parameters. Many studies have addressed one or more of these challenges. This study proposes an algorithm called parallel hybrid clustering using genetic programming and multi-objective fitness with density (PYRAMID). While still leaving significant challenges unresolved, such as handling higher dimensions and dependence on user-supplied parameters, PYRAMID employs a combination of data parallelism, a form of genetic programming, and a multi-objective density-based fitness function in the context of clustering to resolve most of the above challenges. Preliminary experiments have yielded promising results.
机译:群集是在大数据集中定位模式的过程。它是一个活跃的研究区,可为科学和商业应用提供价值。实用聚类面临着多种挑战,包括:识别任意形状的集群,对输入顺序的敏感性,动态确定集群的数量,异常数据集的异常数据集的处理速度,处理更高的维度以及对用户提供的参数的依赖性。许多研究已经解决了其中一个或多个这些挑战。本研究提出了一种使用密度(金字塔)的遗传编程和多目标适应性称为并联混合聚类的算法。虽然仍然留下了未解决的重大挑战,例如处理更高的维度和对用户提供的参数的依赖性,但金字塔采用数据并行性,遗传编程形式的组合,以及在聚类的背景下的基于多目标密度的适应性函数解决大多数上述挑战。初步实验产生了有希望的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号