...
首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >Penalized -regression-based bicluster localization
【24h】

Penalized -regression-based bicluster localization

机译:基于惩罚的 - 基于Bicluster本地化

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Biclustering (co-clustering, two-mode clustering), as one of the classical unsupervised learning meth-ods, has been applied in many different fields in recent years. Different types of biclustering methods have been developed such as probabilistic methods, two-way clustering methods, variance minimization methods, and so on. However, few regression-based methods have been proposed to the best of our knowledge. Such methods have been applied in traditional clustering, which can improve both the com-putational efficiency and the clustering accuracy. In this paper, we present a penalized regression-based method for localizing the biclusters (PRbiclust). By imposing Truncated LASSO Penalty (TLP) and group TLP terms to penalize the column vectors and the row vectors in the regression model, the structure of biclusters in the data matrix is recovered. The model is formulated as an optimization problem with nonconvex penalties, and a computationally efficient algorithm is proposed to solve it. Convergence of the algorithm is proved. To extract the biclusters from the recovered data matrix, we propose a graph-based localization method. An evaluation criterion is also proposed to measure the efficiency of bicluster localization when noise entries exist. We apply the proposed method to both simulated datasets with different setups and a real dataset. Experiments show that this method can well capture the bicluster structure, and performs better than the existing works.
机译:双聚类(co-clustering,two-mode clustering)是一种经典的无监督学习方法,近年来在许多领域得到了应用。已经发展了不同类型的双聚类方法,如概率方法、双向聚类方法、方差最小化方法等。然而,据我们所知,很少有人提出基于回归的方法。这些方法已经应用于传统聚类中,可以提高计算效率和聚类精度。在本文中,我们提出了一种基于惩罚回归的方法来定位双聚类(PRbiclust)。通过施加截断套索惩罚(TLP)和分组TLP项来惩罚回归模型中的列向量和行向量,恢复了数据矩阵中的双聚类结构。该模型被描述为一个具有非凸惩罚的优化问题,并提出了一种计算效率高的算法来解决该问题。证明了算法的收敛性。为了从恢复的数据矩阵中提取双聚类,我们提出了一种基于图的定位方法。此外,还提出了一个评估标准来衡量存在噪声输入时双聚类定位的效率。我们将所提出的方法应用于具有不同设置的模拟数据集和真实数据集。实验表明,该方法能很好地捕捉双聚类结构,性能优于现有的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号