首页> 外文会议>International Conference on Communication Systems and Networks >Discovering signature of social networks with application to community detection
【24h】

Discovering signature of social networks with application to community detection

机译:发现社交网络的签名并将其应用于社区检测

获取原文

摘要

Today, any online social media collects a huge volume of data not just about who is linked with whom (aka link data) but also, about who is interacting with whom (aka interaction data). The presence of both variety and volume in these datasets pose new challenges, and thereby opportunities for the field of social network analysis (SNA). Traditionally, SNA techniques are designed to work only with the link data. Recently, there have been some attempts to analyze link data in conjunction with interaction data. In this paper, we advance this research agenda further by introducing a notion called signature of a social network and propose an efficient approach to compute it. The signature of a social network is essentially a sparse subgraph of the original social network such that it succinctly captures key information contained within the data sources (both linked and interaction data). The signature of a social network need not be unique. The value behind computing such a signature stems from the fact that once computed, any subsequent SNA (e.g. community detection, influence propagation, etc.) becomes much faster while not compromising much with quality. The concept of importance weights of the edges has been the guiding principle for us behind the idea of signature of a social network. In our approach, we start with deriving importance weights of the edges based on the information contained in these data sources. Next, we apply a novel graph sparsification technique to generate signature of the given social network by dropping edges that are not so informative. We demonstrate the efficacy of the signature of social network with an application to community detection on certain well-known social network datasets such as Digg, Youtube, Epinions, DBLP, and Amazon. We obtained effective community detection results on these datasets using our proposed approach while achieving about 40 times speed-up.
机译:如今,任何在线社交媒体都不仅收集有关谁与谁链接的数据(也称为链接数据),而且还收集有关谁与谁交互的数据(也称为交互数据)。这些数据集中多样性和体积的存在提出了新的挑战,从而为社交网络分析(SNA)领域带来了机遇。传统上,SNA技术被设计为仅与链接数据一起使用。近来,已经进行了一些尝试以结合交互数据来分析链接数据。在本文中,我们通过引入称为社交网络签名的概念来进一步推进该研究议程,并提出了一种有效的计算方法。社交网络的签名本质上是原始社交网络的稀疏子图,因此它可以简洁地捕获数据源(链接数据和交互数据)中包含的关键信息。社交网络的签名不必是唯一的。计算这样的签名的价值来自于这样一个事实,即一旦计算,任何后续的SNA(例如,社区检测,影响传播等)将变得更快,同时又不会对质量造成太大影响。边缘重要性权重的概念一直是我们背后的社交网络签名思想的指导原则。在我们的方法中,我们从基于这些数据源中包含的信息得出边缘的重要性权重开始。接下来,我们应用一种新颖的图稀疏化技术,通过删除不那么有用的边缘来生成给定社交网络的签名。我们通过在某些知名的社交网络数据集(例如Digg,Youtube,Epinions,DBLP和Amazon)上进行社区检测的应用程序,演示了社交网络签名的功效。我们使用我们提出的方法在这些数据集上获得了有效的社区检测结果,同时实现了约40倍的提速。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号