The Design and Implementation of a Scalable Deep Learning Benchmarking Platform

机译：可扩展深层学习基准平台的设计与实现

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The current Deep Learning (DL) landscape is fast-paced and is rife with non-uniform models, hardware/software (HW/SW) stacks. Currently, there is no DL benchmarking platform to facilitate the evaluation and comparison of DL innovations, be it models, frameworks, libraries, or hardware. As a result, the current practice of evaluating the benefits of proposed DL innovations is both arduous and error-prone - stifling the adoption of the innovations. In this work, we first identify 10 design features that are desirable within a DL benchmarking platform. These features include: performing the evaluation in a consistent, reproducible, and scalable manner, being framework and hardware agnostic, supporting real-world benchmarking workloads, providing in-depth model execution inspection across the HW/SW stack levels, etc. We then propose MLModelScope, a DL benchmarking platform that realizes these 10 design objectives. MLModelScope introduces a specification to define DL model evaluations and provides a runtime to provision the evaluation workflow using the user-specified HW/SW stack. MLModelScope defines abstractions for frameworks and supports the board range of DL models and evaluation scenarios. We implement MLModelScope as an open-source project with support for all major frameworks and hardware architectures. Through MLModelScope's evaluation and automated analysis workflows, we perform a case-study analysis of 37 models across 4 systems and show how model, hardware, and framework selection affects model accuracy and performance under different benchmarking scenarios. We further demonstrate how MLModelScope's tracing capability gives a holistic view of model execution and helps pinpoint bottlenecks.

机译：当前的深度学习（DL）景观快节奏，具有非均匀型号，硬件/软件（HW / SW）堆叠。目前，没有DL基准测试平台，以促进DL创新的评估和比较，是IT模型，框架，库或硬件。因此，目前评估所提出的DL创新益处的实践是艰巨和错误的 - 令人难以置疑的 - 扼杀通过创新的采用。在这项工作中，我们首先识别在DL基准测试平台内所需的10个设计特征。这些功能包括：以一致，可重复和可扩展的方式执行评估，是框架和硬件不可知论者，支持实际基准工作负载，在HW / SW堆栈级别等中提供深入的模型执行检查等。然后，我们提出MLModelscope是一种实现这10个设计目标的DL基准测试平台。 MLModelsCope引入了一种规范，用于定义DL模型评估，并提供使用用户指定的HW / SW堆栈提供评估工作流程的运行时。 mlmodelscope定义了框架的抽象，并支持DL模型和评估方案的棋盘范围。我们以支持所有主要框架和硬件架构的支持，我们将MLModelsCope作为开源项目。通过MLModelscope的评估和自动分析工作流程，我们在4个系统中执行37个型号的案例研究分析，并展示了如何在不同的基准测试方案下影响模型，硬件和框架选择对模型准确性和性能影响。我们进一步展示了Mlmodelscope的跟踪功能如何提供模型执行的整体视图，并有助于查明瓶颈。

著录项

来源
《IEEE International Conference on Cloud Computing》|2020年|414-425|共12页
会议地点
作者
Cheng Li; Abdul Dakkak; Jinjun Xiong; Wen-mei Hwu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Deep learning; Analytical models; Technological innovation; Cloud computing; Computational modeling; Benchmark testing; Hardware;

机译：深入学习;分析模型;技术创新;云计算;计算建模;基准测试;硬件;

相似文献

外文文献
中文文献
专利

1. A platform-based SoC design and implementation of scalable automaton matching for deep packet inspection [J] . Lin YD, Tseng KK, Lee TH, Journal of systems architecture . 2007,第12期

机译：基于平台的SoC设计和可扩展自动机匹配的实现，用于深度数据包检查
2. Cross Hardware-Software Boundary Exploration for Scalable and Optimized Deep Learning Platform Design [J] . Baozi Chen, Lei Wang, Qingbo Wu, Embedded Systems Letters, IEEE . 2018,第4期

机译：可扩展和优化的深度学习平台设计的跨软硬件边界探索
3. Design and Implementation of Learning Management Platform for Aviation Flight Training Based on SCORM/AICC Standard—A Case Study of K Airline Company Flight Training Learning Platform [J] . Advanced Science Letters . 2018,第7期

机译：基于SCORM / AICC标准的航空飞行培训学习管理平台的设计与实现 - 以K航空公司飞行训练学习平台为例
4. Design and Implementation of Anti-clamp System for Learning Platform Based on Deep Learning [C] . Zhuoran Xu, Jiafei Zong, Ling Ding, Chinese Automation Congress . 2020

机译：基于深度学习的学习平台防钳系统的设计与实现
5. Design and Implementation of a Micro-world Simulation Platform for Condition-based Maintenance Using Machine Learning Algorithms [D] . Quispe Guanoluisa, David Armando. 2020

机译：使用机器学习算法的条件维护微观世界仿真平台的设计与实现
6. Comparing Neuromorphic Solutions in Action: Implementing a Bio-Inspired Solution to a Benchmark Classification Task on Three Parallel-Computing Platforms [O] . Alan Diamond, Thomas Nowotny, Michael Schmuker 2015

机译：在实践中比较神经形态解决方案：在三个并行计算平台上为基准分类任务实施生物启发性解决方案
7. The Design and Implementation of a Scalable Deep Learning Benchmarking Platform [O] . Cheng Li, Abdul Dakkak, Jinjun Xiong, 2020

机译：可扩展深层学习基准平台的设计与实现

The Design and Implementation of a Scalable Deep Learning Benchmarking Platform

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅