首页> 外文会议>IEEE International Conference on Network Protocols >Community classification on Decentralized Social Networks based on 2-hop neighbourhood information
【24h】

Community classification on Decentralized Social Networks based on 2-hop neighbourhood information

机译:基于两跳邻域信息的去中心化社交网络社区分类

获取原文

摘要

Decentralized Social Network (DSN) has attracted a lot of research and development interest in recent years. It is believed to be the solution to many problems of centralized services. Due to the data limitation imposed by common decentralized architectures, centralized algorithms that support social networking functions need to be re-designed. In this work, we tackle the problem of community detection for a given user under the constraint of limited local topology information. This naturally yields a classification formulation for community detection. As an initial study, we focus on a specific type of classifiers - classification by thresholding against a proximity measure between nodes. We investigated four proximity measures: Common Neighbours (CN), Adamic/Adar score (AA), Page Rank (PR), Personalized PageRank (PPR). Using data collected from a large-scale Social Networking Service (SNS) in practice, we show that PPR can outperform the others with a few pre-known labels (37.5% to 64.97% relative improvement in terms of Area Under the ROC Curve). We further carry out extensive numerical evaluation of PPR, showing that more pre-known labels can linearly increase the capability of the single-feature classifier based on PPR. Users can thus seek for a trade-off between labeling cost and classification accuracy.
机译:近年来,去中心化社交网络(DSN)吸引了许多研究和开发兴趣。据信,这是解决集中式服务的许多问题的解决方案。由于常见的分散式架构所施加的数据限制,需要重新设计支持社交网络功能的集中式算法。在这项工作中,我们解决了在有限的本地拓扑信息约束下针对给定用户的社区检测问题。这自然产生了用于社区检测的分类公式。作为初始研究,我们专注于特定类型的分类器-通过针对节点之间的邻近度度量进行阈值化进行分类。我们调查了四个接近度度量:常见邻居(CN),亚当/阿达(Adamic / Adar)得分(AA),页面排名(PR),个性化页面排名(PPR)。在实践中使用从大型社交网络服务(SNS)收集的数据,我们显示PPR可以通过一些知名标签(在ROC曲线下的面积方面相对改善37.5%至64.97%)优于其他PPR。我们进一步对PPR进行了广泛的数值评估,结果表明,更多的知名标签可以线性增加基于PPR的单特征分类器的功能。因此,用户可以在标签成本和分类准确性之间寻求折衷。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号