首页> 外文会议>International Conference on Computational Statistics >Three-way asymmetric hierarchical clustering based on regularized similarity models
【24h】

Three-way asymmetric hierarchical clustering based on regularized similarity models

机译:基于正则化相似模型的三通非对称层次聚类

获取原文

摘要

Three-way two-mode asymmetric data are observed in various situations such as brand switching, psychological research, and web mining. When clustering algorithms are applied to such data, several problems occur. One problem involves dealing with asymmetries. For two-way asymmetric data, there are two approaches to deal with asymmetries when using clustering algorithms. The first approach is to convert asymmetric similarities to symmetric similarities. The other approach is to introduce objective functions that consider internal variations of each cluster. However, for these clustering algorithms, it is difficult to understand the asymmetric features of the clustering results. The second problem involves determining the effects of occasions. Sato and Jain (2006) proposed fuzzy clustering for three-way two-mode asymmetric data and considered the effects of occasions[9]. In this paper, we propose two types of regularized similarity models and three-way asymmetric hierarchical clustering using entropy regularization. One regularized similarity model can provide us with factors of the direction of asymmetries, while the other model can provide us with factors comprising symmetric and asymmetric parts of asymmetric data. In addition, we introduce the factors of occasions using entropy regularization. Therefore, an advantage of the proposed algorithm is that researchers can easily interpret the clustering results.
机译:在品牌交换,心理研究和网上采矿等各种情况下观察到三元两样不对称数据。当聚类算法应用于此类数据时,发生了几个问题。一个问题涉及处理不对称。对于双向不对称数据,使用聚类算法时有两种方法可以处理不对称。第一种方法是将不对称的相似性转换为对称相似之处。另一种方法是引入考虑每个群集的内部变体的客观函数。但是,对于这些聚类算法,难以理解聚类结果的不对称特征。第二个问题涉及确定场合的效果。佐藤和耆那教徒(2006)提出了三元两样不对称数据的模糊聚类,并考虑了场合的影响[9]。在本文中,我们提出了两种类型的正则化相似模型和三通非对称分层聚类,使用熵正则化。一个正则化相似之处可以为我们提供不对称方向的因素,而另一个模型可以为我们提供包括不对称数据的对称和不对称部分的因素。此外,我们还使用熵正则化介绍了场合的因素。因此,所提出的算法的优点是研究人员可以容易地解释聚类结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号