首页> 外文会议>IEEE High Performance Extreme Computing Conference >A distributed algorithm for the efficient computation of the unified model of social influence on massive datasets
【24h】

A distributed algorithm for the efficient computation of the unified model of social influence on massive datasets

机译:一种有效计算海量数据社会影响统一模型的分布式算法

获取原文

摘要

Online social networks offer a rich data source for analyzing diffusion processes including rumor and viral spreading in communities. While many models exist, a unified model which enables analytical computation of complex, nonlinear phenomena while considering multiple factors was only recently proposed. We design an optimized implementation of the unified model of influence for vertex centric graph processing distributed platforms such as Apache Giraph. We validate and test the weak and strong scalability of our implementation on a Google Cloud Platform Hadoop and a Giraph installation using two real datasets. Results show a ~3.2× performance improvement over the single node runtime on the same platform. We also analyze the cost of achieving this speedup on public clouds as well as the impact of the underlying platform and the requirement of having more distributed nodes to process the same dataset as compared to a shared memory system.
机译:在线社交网络为分析传播过程(包括谣言和病毒在社区中的传播)提供了丰富的数据源。尽管存在许多模型,但最近才提出了一个统一模型,该模型能够在考虑多个因素的情况下对复杂的非线性现象进行分析计算。我们为诸如Apache Giraph之类的以顶点为中心的图形处理分布式平台设计了统一影响模型的优化实现。我们使用两个真实的数据集验证并测试了我们在Google Cloud Platform Hadoop和Giraph安装上实施的弱扩展性和强扩展性。结果表明,在同一平台上,单节点运行时的性能提高了约3.2倍。我们还分析了在公共云上实现这种加速的成本,以及底层平台的影响,以及与共享内存系统相比,需要更多分布式节点来处理同一数据集的需求。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号