首页> 外文学位 >Topics in Graph Clustering
【24h】

Topics in Graph Clustering

机译:图聚类中的主题

获取原文
获取原文并翻译 | 示例

摘要

In this thesis, two problems in social networks will be studied.;In the first part of the thesis, we focus on community recovery problems for social networks. There have been many recent theoretical advances in the model-based community recovery for network data. In the center of it are the Stochastic Block Model (SBM) and its extension, Degree Corrected Stochastic Block Model (DC-SBM). Under assumptions on the balance and separation of clusters, theoretical guarantees have been provided to ensure the recovery of the true clusters with high probability.We firstly benchmark the current recovery theorems on DC-SBM through experimental approaches. The experiments suggest that there are still lots of cases that are recoverable but not predicted by the current recovery theorems. We then introduce a wider class of network models called Preference Frame Model. We show that under weaker assumptions, the communities or clusters can be recovered by spectral clustering algorithm with essentially the same guarantees. The model-based results, despite their importance, are limited by a strong and difficult-to-verify assumption that the observed data are generated from the model. We present the model-free community recovery, where we do not make assumptions about the data generating process and provide theoretical guarantees for the performance of the model based clustering algorithms in this framework.;In the second part of the thesis, we propose a perturbation framework to measure the robustness of graph properties. Although there are already perturbation methods proposed to tackle this problem, they are limited by the fact that the strength of the perturbation cannot be well controlled. We firstly provide a perturbation framework on graphs by introducing weights on the nodes, of which the magnitude of perturbation can be easily controlled through the variance of the weights. Meanwhile, the topology of the graphs are also preserved to avoid uncontrollable strength in the perturbation. We then extend the measure of robustness in the robust statistics literature to the graph properties.
机译:本文主要研究社会网络中的两个问题。在论文的第一部分,我们着重研究社会网络中的社区恢复问题。在基于模型的网络数据社区恢复中,最近有许多理论上的进步。它的中心是随机块模型(SBM)及其扩展,即度校正随机块模型(DC-SBM)。在假设集群平衡和分离的前提下,提供了理论上的保证以确保真实集群的恢复概率很高。我们首先通过实验方法对DC-SBM上的当前恢复定理进行基准测试。实验表明,仍然有很多情况是可恢复的,但当前恢复定理无法预测这些情况。然后,我们介绍一类称为“偏好框架模型”的网络模型。我们表明,在较弱的假设下,可以通过光谱聚类算法以基本相同的保证来恢复群落或聚类。尽管基于模型的结果很重要,但受到强有力且难以验证的假设(即观察到的数据是从模型中生成)的限制。我们提出了无模型的社区恢复,在此我们不对数据生成过程做任何假设,并在此框架下为基于模型的聚类算法的性能提供了理论保证。;在论文的第二部分,我们提出了一种扰动衡量图属性鲁棒性的框架。尽管已经提出了解决该问题的摄动方法,但是它们受到不能很好地控制摄动强度的事实的限制。首先,通过在节点上引入权重,在图上提供一个扰动框架,通过权重的变化可以很容易地控制扰动的大小。同时,还保留了图的拓扑结构,以避免扰动中不可控制的强度。然后,我们将鲁棒统计资料中的鲁棒性度量扩展到图属性。

著录项

  • 作者

    Wan, Yali.;

  • 作者单位

    University of Washington.;

  • 授予单位 University of Washington.;
  • 学科 Statistics.;Mathematics.;Computer science.
  • 学位 Ph.D.
  • 年度 2017
  • 页码 118 p.
  • 总页数 118
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号