...
首页> 外文期刊>Intelligent data analysis >Bayesian hierarchical K-means clustering
【24h】

Bayesian hierarchical K-means clustering

机译:贝叶斯分层K-Means Clustering

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Clustering algorithm is the foundation and important technology in data mining. In fact, in the real world, the data itself often has a hierarchical structure. Hierarchical clustering aims at constructing a cluster tree, which reveals the underlying modal structure of a complex density. Due to its inherent complexity, most existing hierarchical clustering algorithms are usually designed heuristically without an explicit objective function, which limits its utilization and analysis. K-means clustering, the well-known simple yet effective algorithm which can be expressed from the view of probability distribution, has inherent connection to Mixture of Gaussians (MoG). At this point, we consider combining Bayesian theory analysis with K-means algorithm. This motivates us to develop a hierarchical clustering based on K-means under the probability distribution framework, which is different from existing hierarchical K-means algorithms processing data in a single-pass manner along with heuristic strategies. For this goal, we propose an explicit objective function for hierarchical clustering, termed as Bayesian hierarchical K-means (BHK-means). In our method, a cascaded clustering tree is constructed, in which all layers interact with each other in the network-like manner. In this cluster tree, the clustering results of each layer are influenced by the parent and child nodes. Therefore, the clustering result of each layer is dynamically improved in accordance with the global hierarchical clustering objective function. The objective function is solved using the same algorithm as K-means, the Expectation-maximization algorithm. The experimental results on both synthetic data and benchmark datasets demonstrate the effectiveness of our algorithm over the existing related ones.
机译:聚类算法是数据挖掘中的基础和重要技术。事实上,在现实世界中,数据本身通常具有分层结构。分层群集旨在构建群集树,其揭示复杂密度的底层模态结构。由于其固有的复杂性,大多数现有的分层聚类算法通常在没有明确的客观函数的情况下启发式设计,这限制了其利用和分析。 K-Means Clustering,可从概率分布的视野中表达的众所周知的简单且有效的算法,具有与高斯(MOG)混合的固有连接。此时,我们考虑将贝叶斯理论分析与K-means算法相结合。这激励我们基于概率分布框架下基于K-means的分层聚类,其与现有的分层K-Means算法不同,以单通方式以及启发式策略。对于此目标,我们提出了一个明确的客观函数,用于分层聚类,称为贝叶斯分层K-means(BHK-Means)。在我们的方法中,构建级联聚类树,其中所有层以网络类似的方式相互交互。在此群集树中,每层的聚类结果受父节点的影响。因此,根据全局分层聚类目标函数动态地改善了每层的聚类结果。目标函数使用与K-means相同的算法,期望最大化算法来解决。合成数据和基准数据集的实验结果展示了我们算法对现有相关的算法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号