HyPar: Towards Hybrid Parallelism for Deep Learning Accelerator Array

机译：HyPar：面向深度学习加速器阵列的混合并行

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

With the rise of artificial intelligence in recent years, Deep Neural Networks (DNNs) have been widely used in many domains. To achieve high performance and energy efficiency, hardware acceleration (especially inference) of DNNs is intensively studied both in academia and industry. However, we still face two challenges: large DNN models and datasets, which incur frequent off-chip memory accesses; and the training of DNNs, which is not well-explored in recent accelerator designs. To truly provide high throughput and energy efficient acceleration for the training of deep and large models, we inevitably need to use multiple accelerators to explore the coarse-grain parallelism, compared to the fine-grain parallelism inside a layer considered in most of the existing architectures. It poses the key research question to seek the best organization of computation and dataflow among accelerators.

机译：近年来，随着人工智能的兴起，深度神经网络（DNN）已广泛应用于许多领域。为了实现高性能和高能效，学术界和工业界都对DNN的硬件加速（尤其是推理）进行了深入研究。但是，我们仍然面临两个挑战：大型DNN模型和数据集，这会导致频繁的片外内存访问;以及DNN的训练，这在最近的加速器设计中还没有得到很好的探索。为了真正提供高吞吐率和高能效的加速来训练深度模型和大型模型，与大多数现有体系结构中考虑的层内的细粒度并行性相比，我们不可避免地需要使用多个加速器来探索粗粒度并行性。它提出了关键的研究问题，以寻求加速器之间最佳的计算和数据流组织。

著录项

来源
《IEEE International Symposium on High Performance Computer Architecture》|2019年|56-68|共13页
会议地点
作者
Linghao Song; Jiachen Mao; Youwei Zhuo; Xuehai Qian; Hai Li; Yiran Chen;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Parallel processing; Training; Data models; Neural networks; Computer architecture; Acceleration;

机译：并行处理;训练;数据模型;神经网络;计算机体系结构;加速;

相似文献

外文文献
中文文献
专利

1. Toward Functional Safety of Systolic Array-Based Deep Learning Hardware Accelerators [J] . Kundu Shamik, Banerjee Suvadeep, Raha Arnab, IEEE transactions on very large scale integration (VLSI) systems . 2021,第3期

机译：朝着基于收缩阵列的深度学习硬件加速器的功能安全
2. Enabling Timing Error Resilience for Low-Power Systolic-Array Based Deep Learning Accelerators [J] . Zhang Jeff, Ghodsi Zahra, Garg Siddharth, IEEE Design & Test of Computers Magazine . 2020,第2期

机译：基于低功耗的Systolic-阵列的深度学习加速器启用定时误差弹性
3. The Case for Strong Scaling in Deep Learning: Training Large 3D CNNs With Hybrid Parallelism [J] . Oyama Yosuke, Maruyama Naoya, Dryden Nikoli, IEEE Transactions on Parallel and Distributed Systems . 2021,第7期

机译：深度学习强度缩放的案例：用混合并行性训练大3D CNN
4. HyPar: Towards Hybrid Parallelism for Deep Learning Accelerator Array [C] . Linghao Song, Jiachen Mao, Youwei Zhuo, IEEE International Symposium on High Performance Computer Architecture . 2019

机译：Hypar：对深度学习加速器阵列的混合并行性
5. Secure Deep Learning Accelerators [D] . Mera Collantes, Maria I. 2021

机译：安全深受学习加速器
6. Hybrid Deep-Learning and Machine-Learning Models for Predicting COVID-19 [O] . Talal S. Qaid, Hussein Mazaar, Mohammad Yahya H. Al-Shamri, 2021

机译：用于预测Covid-19的混合深学习和机器学习模型
7. HyPar: Towards Hybrid Parallelism for Deep Learning Accelerator Array [O] . Linghao Song, Jiachen Mao, Youwei Zhuo, 2019

机译：Hypar：对深度学习加速器阵列的混合并行性

HyPar: Towards Hybrid Parallelism for Deep Learning Accelerator Array

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅