首页> 外文学位 >Semidefinite Programming Approaches to Network Clustering and Smoothing
【24h】

Semidefinite Programming Approaches to Network Clustering and Smoothing

机译:网络聚类和平滑的半定规划方法

获取原文
获取原文并翻译 | 示例

摘要

Community detection and link prediction are two important problems in network analysis. In this dissertation, we propose semidefinite programming approaches to network community detection and edge probabilities estimation. Interestingly, despite different motivations, it turns out that the proposed approaches to the two problems are based on the same semidefinite program (SDP) with appropriately chosen input matrices. Our SDP relies on a semidefinite relaxation of the set of normalized clustering matrices. We derive optimality conditions of the SDP from a geometric point of view and deal with a convex body, called the Fantope. This enables us to analyze the SDP by making use of the properties of the Fantope. We design an efficient alternating direction method of multipliers algorithm to solve the SDP, which only requires computation of a small number of leading eigenvectors of symmetric matrices.;For community detection, the SDP is derived from the partition criterion of maximizing the sum of average intra-cluster similarities over all clusters. The feasible set of our SDP is contained in the Fantope, which enables us to connect our SDP to sparse PCA and spectral clustering. Unlike previously proposed SDPs for community detection which use the adjacency matrix as the input matrix, we also consider using the symmetric normalized graph Laplacian matrix and the squared adjacency matrix as the input matrix to the SDP. The former choice of the input matrix is motivated by the connection between our SDP and spectral clustering and the latter builds an interesting connection between network clustering and multivariate data clustering. Using optimality conditions, we analyze the population consistency of the SDP using different input matrices under various network models. We show the exact recovery property of our SDP under strongly assortative stochastic block models when the input matrix is chosen to be the adjacency matrix. For network edge probability estimation, the SDP is derived from a least squares criterion. Since the solution to our SDP is a symmetric doubly stochastic matrix, it can be used as a smoother matrix to smooth the adjacency matrix. The estimated edge probability matrix is the smoothed adjacency matrix. We also connect our SDP-based method of estimating edge probabilities to a recently proposed neighborhood smoothing method. Our numerical experiments show that the proposed SDP has good empirical performances for community detection, edge probability estimation, and link prediction and it outperforms other state-of-the-art methods on Facebook collegiate networks for both community detection and link prediction.
机译:社区检测和链接预测是网络分析中的两个重要问题。本文提出了一种半定规划的网络社区检测和边缘概率估计方法。有趣的是,尽管动机不同,但事实证明,针对两个问题的建议方法是基于相同的半确定程序(SDP),并具有适当选择的输入矩阵。我们的SDP依赖于标准化聚类矩阵集的半确定松弛。我们从几何角度得出SDP的最优条件,并处理一个称为Fantope的凸体。这使我们能够利用Fantope的属性来分析SDP。我们设计了一种高效的乘数交替方向算法来求解SDP,它只需要计算少量对称矩阵的前导特征向量即可;对于社区检测,SDP是从最大化平均内部和的划分准则中得出的-所有集群上的集群相似性。 SDP的可行集包含在Fantope中,这使我们能够将SDP连接到稀疏PCA和频谱聚类。与先前使用邻接矩阵作为输入矩阵的用于社区检测的SDP不同,我们还考虑使用对称归一化图拉普拉斯矩阵和平方邻接矩阵作为SDP的输入矩阵。输入矩阵的前一种选择是由我们的SDP和频谱聚类之间的联系所激发的,而后者则在网络聚类和多元数据聚类之间建立了有趣的联系。使用最优条件,我们在各种网络模型下使用不同的输入矩阵来分析SDP的总体一致性。当选择输入矩阵作为邻接矩阵时,我们显示了在强分类随机块模型下SDP的确切恢复特性。对于网络边缘概率估计,从最小二乘准则得出SDP。由于我们SDP的解决方案是对称双随机矩阵,因此它可以用作平滑矩阵来平滑邻接矩阵。估计的边缘概率矩阵是平滑的邻接矩阵。我们还将估计边缘概率的基于SDP的方法与最近提出的邻域平滑方法相连接。我们的数值实验表明,所提出的SDP在社区检测,边缘概率估计和链接预测方面具有良好的经验性能,并且优于Facebook大学网络上用于社区检测和链接预测的其他最新方法。

著录项

  • 作者

    Yan, Zhifei.;

  • 作者单位

    The Ohio State University.;

  • 授予单位 The Ohio State University.;
  • 学科 Statistics.
  • 学位 Ph.D.
  • 年度 2017
  • 页码 111 p.
  • 总页数 111
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号