This paper presents EDA Cluster, an Estimation of Distribution Algorithm (EDA) applied to the clustering task. EDA is an Evolutionary Algorithm used here to optimize the search for adequate clusters when very little is known about the target dataset. The proposed algorithm uses a mixed approach - density and grid-based - to identify sets of dense cells in the dataset. The output is a list of items and their associated clusters. Items in low-density areas are considered noise and are not assigned to any cluster. This work uses four public domain datasets to perform the tests that compare EDA Cluster with DBSCAN, a conventional density-based clustering algorithm.
展开▼