Topics in Graph Clustering

机译：图聚类中的主题

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this thesis, two problems in social networks will be studied.;In the first part of the thesis, we focus on community recovery problems for social networks. There have been many recent theoretical advances in the model-based community recovery for network data. In the center of it are the Stochastic Block Model (SBM) and its extension, Degree Corrected Stochastic Block Model (DC-SBM). Under assumptions on the balance and separation of clusters, theoretical guarantees have been provided to ensure the recovery of the true clusters with high probability.We firstly benchmark the current recovery theorems on DC-SBM through experimental approaches. The experiments suggest that there are still lots of cases that are recoverable but not predicted by the current recovery theorems. We then introduce a wider class of network models called Preference Frame Model. We show that under weaker assumptions, the communities or clusters can be recovered by spectral clustering algorithm with essentially the same guarantees. The model-based results, despite their importance, are limited by a strong and difficult-to-verify assumption that the observed data are generated from the model. We present the model-free community recovery, where we do not make assumptions about the data generating process and provide theoretical guarantees for the performance of the model based clustering algorithms in this framework.;In the second part of the thesis, we propose a perturbation framework to measure the robustness of graph properties. Although there are already perturbation methods proposed to tackle this problem, they are limited by the fact that the strength of the perturbation cannot be well controlled. We firstly provide a perturbation framework on graphs by introducing weights on the nodes, of which the magnitude of perturbation can be easily controlled through the variance of the weights. Meanwhile, the topology of the graphs are also preserved to avoid uncontrollable strength in the perturbation. We then extend the measure of robustness in the robust statistics literature to the graph properties.

机译：本文主要研究社会网络中的两个问题。在论文的第一部分，我们着重研究社会网络中的社区恢复问题。在基于模型的网络数据社区恢复中，最近有许多理论上的进步。它的中心是随机块模型（SBM）及其扩展，即度校正随机块模型（DC-SBM）。在假设集群平衡和分离的前提下，提供了理论上的保证以确保真实集群的恢复概率很高。我们首先通过实验方法对DC-SBM上的当前恢复定理进行基准测试。实验表明，仍然有很多情况是可恢复的，但当前恢复定理无法预测这些情况。然后，我们介绍一类称为“偏好框架模型”的网络模型。我们表明，在较弱的假设下，可以通过光谱聚类算法以基本相同的保证来恢复群落或聚类。尽管基于模型的结果很重要，但受到强有力且难以验证的假设（即观察到的数据是从模型中生成）的限制。我们提出了无模型的社区恢复，在此我们不对数据生成过程做任何假设，并在此框架下为基于模型的聚类算法的性能提供了理论保证。;在论文的第二部分，我们提出了一种扰动衡量图属性鲁棒性的框架。尽管已经提出了解决该问题的摄动方法，但是它们受到不能很好地控制摄动强度的事实的限制。首先，通过在节点上引入权重，在图上提供一个扰动框架，通过权重的变化可以很容易地控制扰动的大小。同时，还保留了图的拓扑结构，以避免扰动中不可控制的强度。然后，我们将鲁棒统计资料中的鲁棒性度量扩展到图属性。

著录项

作者
Wan, Yali.;
展开▼
作者单位

University of Washington.;

展开▼
授予单位 University of Washington.;
学科 Statistics.;Mathematics.;Computer science.
学位 Ph.D.
年度 2017
页码 118 p.
总页数 118
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Paragraph Selection Methods Using Feature-Based on Segment-Based Clustering Process Using Paragraphs for Identifying Topics on Indication Detection of Plagiarism System [J] . Denar Regata Akbi, Arini Rahmawati Rosyadi Kinetik . 2018,第2期

机译：基于特征的段落选择方法基于段的聚类过程使用段落识别of窃系统指示检测的主题
2. Analyzing temporal patterns of topic diversity using graph clustering [J] . Hashimoto Takako, Shepard David Lawrence, Kuboyama Tetsuji, Journal of supercomputing . 2021,第5期

机译：使用图形聚类分析主题分集的时间模式
3. Graph-based topic models for trajectory clustering in crowd videos [J] . Manal Al Ghamdi, Yoshihiko Gotoh Machine Vision and Applications . 2020,第5期

机译：基于图形的轨迹群集在人群视频中的主题模型
4. Discovering hot topics using Twitter streaming data social topic detection and geographic clustering [C] . Kim Hwi-Gang, Lee Seongjoo, Kyeong Sunghyon IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining . 2013

机译：使用Twitter流数据社交话题检测和地理聚类发现热门话题
5. Topics in Clustering: Feature Selection and Semiparametric Modeling [D] . Pu, Xiao. 2017

机译：群集主题：特征选择和半甲酰型建模
6. Detecting sentiment dynamics and clusters of Twitter users for trending topics in COVID-19 pandemic [O] . Md Shoaib Ahmed, Tanjim Taharat Aurpa, Md Musfique Anwar, 2021

机译：检测Covid-19大流行中的Twitter用户的情感动力学和群集群
7. Graph-based topic models for trajectory clustering in crowd videos [O] . Manal Al Ghamdi, Yoshihiko Gotoh 2020

机译：基于图形的轨迹群集在人群视频中的主题模型
8. Dempster-Shafer Clustering, Potts Spins, and Graph Optimization (Dempster-Shafer Clustering, Potts Spins, and Graph Optimization). Scientific Report. [R] . Bengtsson, M., Schubert, J. 1999

机译：Dempster-shafer聚类，potts自旋和图优化（Dempster-shafer聚类，potts自旋和图优化）。科学报告。

Topics in Graph Clustering

摘要

著录项

相似文献

相关主题

期刊订阅