首页> 外文会议>International conference on high performance computing >Quantifying Communication in Graph Analytics
【24h】

Quantifying Communication in Graph Analytics

机译:在图分析中量化交流

获取原文

摘要

Data analytics require complex processing, often taking the shape of parallel graph-based workloads. In ensuring a high level of efficiency for these applications, understanding where the bottlenecks lie is key, particularly understanding to which extent their performance is computation or communication-bound. In this work, we analyze a reference workload in graph-based analytics, the Graph 500 benchmark. We conduct a wide array of tests on a high-performance computing system, the MareNostrum Ⅲ supercomputer, using a custom high-precision profiling methodology. We show that the application performance is communication-bound, with up to 80 % of the execution time being spent enabling communication. We equally show that, with the increase in scale and concurrency that is expected in future big data systems and applications, the importance of communication increases. Finally, we characterize this representative data-analytics workload and show that the dominating data exchange is uniform all-to-all communication, opening avenues for workload and network optimization.
机译:数据分析需要复杂的处理,通常采用基于并行图的工作负载的形式。为了确保这些应用程序具有较高的效率,关键是要了解瓶颈所在,特别是要了解瓶颈在何种程度上受到计算或通信的限制。在这项工作中,我们在基于图的分析(图500基准)中分析了参考工作负载。我们使用定制的高精度配置方法在高性能计算系统MareNostrumⅢ超级计算机上进行了广泛的测试。我们证明了应用程序的性能受通信的限制,最多80%的执行时间都花在了通信上。我们同样表明,随着未来大数据系统和应用程序规模和并发性的提高,通信的重要性也在增加。最后,我们对这种代表性的数据分析工作负载进行了表征,并表明主导的数据交换是统一的所有所有通信,为工作负载和网络优化开辟了道路。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号