iCELIA: A Full-Stack Framework for STT-MRAM-Based Deep Learning Acceleration

首页> 外文期刊>IEEE Transactions on Parallel and Distributed Systems >iCELIA: A Full-Stack Framework for STT-MRAM-Based Deep Learning Acceleration

【24h】

iCELIA: A Full-Stack Framework for STT-MRAM-Based Deep Learning Acceleration

机译：iCELIA：基于STT-MRAM的深度学习加速的全栈框架

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

A large variety of applications rely on deep learning to process big data, learn sophisticated features, and perform complicated tasks. Utilizing emerging non-volatile memory (NVM)s unique characteristics, including the crossbar array structure and gray-scale cell resistances, to perform neural network (NN) computation is a well-studied approach in accelerating deep learning applications. Compared to other NVM technologies, STT-MRAM has its unique advantages in performing NN computation. However, the state-of-the-art research have not utilized STT-MRAM for deep learning acceleration due to its device- and architecture-level challenges. Consequently, this paper enables STT-MRAM, for the first time, as an effective and practical deep learning accelerator. In particular, it proposes a full-stack framework iCELIA spanning multiple design levels, including device-level fabrication, circuit-level enhancements, architecture-level synaptic weight quantization, and system-level accelerator design. The primary contributions of iCELIA over our prior work CELIA include a new non-uniform weight quantization scheme and much enhanced accelerator system design. The proposed framework significantly mitigates the model accuracy loss due to reduced data precision in a cohesive manner, constructing a comprehensive STT-MRAM accelerator system for fast NN computation with high energy efficiency and low cost.

机译：各种各样的应用程序都依赖于深度学习来处理大数据，学习复杂的功能并执行复杂的任务。利用新兴的非易失性存储器（NVM）的独特特性（包括交叉开关阵列结构和灰度单元电阻）来执行神经网络（NN）计算，是加速深度学习应用的一种经过充分研究的方法。与其他NVM技术相比，STT-MRAM在执行NN计算方面具有其独特的优势。但是，由于其在设备和体系结构方面的挑战，最新的研究尚未将STT-MRAM用于深度学习加速。因此，本文首次使STT-MRAM成为一种有效且实用的深度学习加速器。特别是，它提出了跨多个设计级别的全栈框架iCELIA，包括设备级别的制造，电路级别的增强，体系结构级别的突触权重量化和系统级别的加速器设计。 iCELIA对我们先前的CELIA所做的主要贡献包括新的非均匀权重量化方案和大大增强的加速器系统设计。所提出的框架以内聚的方式显着减轻了由于数据精度降低而导致的模型精度损失，从而构建了一种用于能源效率高，成本低的快速NN计算的综合STT-MRAM加速器系统。

著录项

来源
《IEEE Transactions on Parallel and Distributed Systems》 |2020年第2期|408-422|共15页
作者

展开▼
作者单位

Samsung Austin TX 78746 USA;

Univ Texas San Antonio Dept Elect & Comp Engn San Antonio TX 78249 USA;

Univ Southern Calif Ming Hsieh Dept Elect Engn Los Angeles CA 90089 USA|Univ Southern Calif Dept Comp Sci Los Angeles CA 90089 USA;

Alibaba DAMO Acad Comp Technol Lab Sunnyvale CA 94085 USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Deep learning; Nonvolatile memory; Computer architecture; Acceleration; Artificial neural networks; Resistance; Microprocessors; STT-MRAM; deep learning acceleration; processing-in-memory; device and architecture co-design;

机译：深度学习;非易失性存储器;计算机架构;加速;人工神经网络;抵抗性;微处理器;STT-MRAM;深度学习加速;内存中处理;设备和架构协同设计;

相似文献

外文文献
中文文献
专利

1. A new network forensic framework based on deep learning for Internet of Things networks: A particle deep framework [J] . Nickolaos Koroniotis, Nour Moustafa, Elena Sitnikova Future generation computer systems . 2020,第Sepa期

机译：基于深度学习的新的网络法医框架网络网络：粒子深框架
2. FiC-RNN: A Multi-FPGA Acceleration Framework for Deep Recurrent Neural Networks [J] . Yuxi SUN, Hideharu AMANO IEICE transactions on information and systems . 2020,第12期

机译：FIC-RNN：用于深度经常性神经网络的多FPGA加速框架
3. A hybrid deep segmentation network for fundus vessels via deep-learning framework [J] . Yang Lei, Wang Huaixin, Zeng Qingshan, Neurocomputing . 2021,第Auga11期

机译：深层学习框架对眼底船的混合深层分割网络
4. Is Using Deep Learning Frameworks Free? Characterizing Technical Debt in Deep Learning Frameworks [C] . Jiakun Liu, Qiao Huang, Xin Xia, IEEE/ACM International Conference on Software Engineering: Software Engineering in Society . 2020

机译：正在使用深度学习框架吗？在深入学习框架中表征技术债务
5. Emerging Opportunities in Machine Learning Hardware Acceleration: From Advanced Neural Networks Implementation to Ultra-efficient Deep Learning Framework Using Next Generation Technology [D] . ?Cai, Ruizhe 2020

机译：机器学习硬件加速的新兴机会：从先进的神经网络实现，使用下一代技术实现超高效的深度学习框架
6. Deep6mA: A deep learning framework for exploring similar patterns in DNA N6-methyladenine sites across different species [O] . Zutan Li, Hangjin Jiang, Lingpeng Kong, 2021

机译：Deep6ma：一种深入的学习框架用于探索不同物种的DNA N6-甲基腺嘌呤位点的类似模式
7. Deep Learning for Heart Rate Estimation From Reflectance Photoplethysmography With Acceleration Power Spectrum and Acceleration Intensity [O] . Heewon Chung, Hoon Ko, Hooseok Lee, 2020

机译：深度学习对反射光电仪测量的加速功率谱和加速强度的影响

iCELIA: A Full-Stack Framework for STT-MRAM-Based Deep Learning Acceleration

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅