首页> 外文会议>IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining >Gossip-based partitioning and replication for Online Social Networks
【24h】

Gossip-based partitioning and replication for Online Social Networks

机译:在线社交网络基于八卦的分区和复制

获取原文

摘要

Online Social Networks (OSNs) have been gaining tremendous growth and popularity in the last decade, as they have been attracting billions of users from all over the world. Such networks generate petabytes of data from the social interactions among their users and create many management and scalability challenges. OSN users share common interests and exhibit strong community structures, which create complex dependability patterns within OSN data, thus, make it difficult to partition and distribute in a data center environment. Existing solutions, such as, distributed databases, key-value stores and auto scaling services use random partitioning to distribute the data across a cluster, which breaks existing dependencies of the OSN data and may generate huge inter-server traffic. Therefore, there is a need for intelligent data allocation strategy that can reduce the network cost for various OSN operations. In this paper, we present a gossip-based partitioning and replication scheme that efficiently splits OSN data and distributes the data across a cluster. We achieve fault tolerance and data locality, for one-hop neighbors, through replication. Our main contribution is a social graph placement strategy that divides the social graph into predefined size partitions and periodically updates the partitions to place socially connected users together. To evaluate our algorithm, we compare it with random partitioning and a state-of-the-art solution SPAR. Results show that our algorithm generates up to four times less replication overhead compared to random partitioning and half the replication overhead compared to SPAR.
机译:在过去的十年中,在线社交网络(OSN)吸引了来自世界各地的数十亿用户,因此获得了巨大的增长和普及。这样的网络从其用户之间的社交互动中生成了PB级的数据,并带来了许多管理和可扩展性挑战。 OSN用户具有共同的兴趣并表现出强大的社区结构,这在OSN数据内创建了复杂的可靠性模式,因此,很难在数据中心环境中进行分区和分发。现有的解决方案,例如分布式数据库,键值存储和自动缩放服务,都使用随机分区在整个群集中分布数据,这打破了OSN数据的现有依赖性,并可能产生巨大的服务器间流量。因此,需要可以减少各种OSN操作的网络成本的智能数据分配策略。在本文中,我们提出了一种基于八卦的分区和复制方案,该方案可以有效地分割OSN数据,并将数据分布在整个群集中。通过复制,我们为一跳邻居实现了容错能力和数据局部性。我们的主要贡献是一种社交图放置策略,该策略将社交图划分为预定义的大小分区,并定期更新分区以将社交连接的用户放置在一起。为了评估我们的算法,我们将其与随机分区和最新解决方案SPAR进行了比较。结果表明,与随机分区相比,我们的算法所产生的复制开销最多可减少四倍,而与SPAR相比,则可减少一半的复制开销。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号