【24h】

Community detection in content-sharing social networks

机译:内容共享社交网络中的社区检测

获取原文

摘要

Network structure and content in microblogging sites like Twitter influence each other — user A on Twitter follows user B for the tweets that B posts on the network, and A may then re-tweet the content shared by B to his/her own followers. In this paper, we propose a probabilistic model to jointly model link communities and content topics by leveraging both the social graph and the content shared by users. We model a community as a distribution over users, use it as a source for topics of interest, and jointly infer both communities and topics using Gibbs sampling. While modeling communities using the social graph, or modeling topics using content have received a great deal of attention, a few recent approaches try to model topics in content-sharing platforms using both content and social graph. Our work differs from the existing generative models in that we explicitly model the social graph of users along with the user-generated content, mimicking how the two entities co-evolve in content-sharing platforms. Recent studies have found Twitter to be more of a content-sharing network and less a social network, and it seems hard to detect tightly knit communities from the follower-followee links. Still, the question of whether we can extract Twitter communities using both links and content is open. In this paper, we answer this question in the affirmative. Our model discovers coherent communities and topics, as evinced by qualitative results on sub-graphs of Twitter users. Furthermore, we evaluate our model on the task of predicting follower-followee links. We show that joint modeling of links and content significantly improves link prediction performance on a sub-graph of Twitter (consisting of about 0.7 million users and over 27 million tweets), compared to generative models based on only structure or only content and paths-based methods such as Katz.
机译:在Twitter等微博站点中的网络结构和内容彼此影响 - 用户A ON Twitter遵循网络上的推文的推文,然后可以将B分享到他/她自己的追随者的内容重新推断。在本文中,我们提出了一个概率模型,通过利用社交图和用户共享的内容共同模拟链接社区和内容主题。我们将社区塑造为对用户的分发,用它作为感兴趣的主题的来源,并使用Gi​​bbs采样共同推断社区和主题。在使用社交图建模社区,或使用内容建模主题已经获得了大量的注意,最近的一些方法尝试使用内容和社交图来模拟内容共享平台中的主题。我们的工作与现有的生成模型不同,因为我们明确地模拟了用户的社交图以及用户生成的内容,模拟了两个实体如何在内容共享平台中共同发展。最近的研究已经发现Twitter更具内容共享网络和较少的社交网络,并且似乎很难从追随窗口链接中检测紧密编织的社区。尽管如此,我们是否可以使用链接和内容提取Twitter社区的问题。在本文中,我们以肯定的方式回答这个问题。我们的模型发现了连贯的社区和主题,因为在Twitter用户的子图中,定性结果被定性结果表达。此外,我们评估了我们的模型预测追随追随者的任务。我们表明,与基于结构或仅基于内容和路径的生成模型相比,链接和内容的联合建模显着提高了在Twitter(由约70万用户组成和超过2700万个推文)上的链路预测性能。 katz等方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号