首页> 外文会议>IEEE International Parallel and Distributed Processing Symposium >A Modular Benchmarking Infrastructure for High-Performance and Reproducible Deep Learning
【24h】

A Modular Benchmarking Infrastructure for High-Performance and Reproducible Deep Learning

机译:高性能和可复制深度学习的模块化基准测试基础架构

获取原文

摘要

We introduce Deep500: the first customizable benchmarking infrastructure that enables fair comparison of the plethora of deep learning frameworks, algorithms, libraries, and techniques. The key idea behind Deep500 is its modular design, where deep learning is factorized into four distinct levels: operators, network processing, training, and distributed training. Our evaluation illustrates that Deep500 is customizable (enables combining and benchmarking different deep learning codes) and fair (uses carefully selected metrics). Moreover, Deep500 is fast (incurs negligible overheads), verifiable (offers infrastructure to analyze correctness), and reproducible. Finally, as the first distributed and reproducible benchmarking system for deep learning, Deep500 provides software infrastructure to utilize the most powerful supercomputers for extreme-scale workloads.
机译:我们介绍Deep500:第一个可自定义的基准测试基础结构,它可以公平地比较大量的深度学习框架,算法,库和技术。 Deep500背后的关键思想是其模块化设计,其中将深度学习分解为四个不同的层次:运营商,网络处理,培训和分布式培训。我们的评估表明,Deep500是可定制的(启用合并和基准化不同的深度学习代码的基准)并且公平(使用精心选择的指标)。而且,Deep500速度快(产生的开销可忽略不计),可验证的(提供基础结构以分析正确性)且可重现。最后,作为第一个用于深度学习的分布式可重现基准测试系统,Deep500提供了软件基础架构,可利用功能最强大的超级计算机处理极端规模的工作负载。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号