首页> 外文会议>IEEE Data Science Workshop >Splitting Methods For Convex Bi-Clustering And Co-Clustering
【24h】

Splitting Methods For Convex Bi-Clustering And Co-Clustering

机译:凸双聚类和共聚的拆分方法

获取原文

摘要

Co-Clustering, the problem of simultaneously identifying clusters across multiple aspects of a data set, is a natural generalization of clustering to higher-order structured data. Recent convex formulations of bi-clustering and tensor co-clustering, which shrink estimated centroids together using a convex fusion penalty, allow for global optimality guarantees and precise theoretical analysis, but their computational properties have been less well studied. In this note, we present three efficient operator-splitting methods for the convex co-clustering problem: a standard two-block ADMM, a Generalized ADMM which avoids an expensive tensor Sylvester equation in the primal update, and a three-block ADMM based on the operator splitting scheme of Davis and Yin. Theoretical complexity analysis suggests, and experimental evidence confirms, that the Generalized ADMM is far more efficient for large problems.
机译:共聚是在数据集的多个方面同时识别聚类的问题,是聚类到高阶结构化数据的自然概括。最近的双聚类和张量共聚的凸公式,通过使用凸融合罚分将估计的质心收缩在一起,可以提供全局最优性保证和精确的理论分析,但是对它们的计算特性的研究较少。在本说明中,我们提出了三种有效的凸共聚问题的算子分解方法:标准的两块式ADMM,在原始更新中避免使用昂贵的张量Sylvester方程的广义ADMM以及基于的三块式ADMM戴维斯和尹的运营商拆分方案。理论复杂性分析表明,实验证据证实,通用ADMM对于大问题的效率要高得多。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号