Re-architecting the on-chip memory sub-system of machine-learning accelerator for embedded devices

机译：重新设计嵌入式设备的机器学习加速器的片上存储子系统

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The rapid development of deep learning are enabling a plenty of novel applications such as image and speech recognition for embedded systems, robotics or smart wearable devices. However, typical deep learning models like deep convolutional neural networks (CNNs) consume so much on-chip storage and high-throughput compute resources that they cannot be easily handled by mobile or embedded devices with thrifty silicon and power budget. In order to enable large CNN models in mobile or more cutting-edge devices for IoT or cyberphysics applications, we proposed an efficient on-chip memory architecture for CNN inference acceleration, and showed its application to our in-house general-purpose deep learning accelerator. The redesigned on-chip memory subsystem, Memsqueezer, includes an active weight buffer set and data buffer set that embrace specialized compression methods to reduce the footprint of CNN weight and data set respectively. The Memsqueezer buffer can compress the data and weight set according to their distinct features, and it also includes a built-in redundancy detection mechanism that actively scans through the work-set of CNNs to boost their inference performance by eliminating the data redundancy. In our experiment, it is shown that the CNN accelerators with Memsqueezer buffers achieves more than 2× performance improvement and reduces 80% energy consumption on average over the conventional buffer design with the same area budget.

机译：深度学习的快速发展是实现大量新颖的应用，例如嵌入式系统，机器人或智能可穿戴设备的图像和语音识别。然而，典型的深度学习模型如深卷积神经网络（CNNS）消耗如此多的片上存储和高吞吐量计算资源，它们不能通过具有节俭的硅和电源预算的移动或嵌入式设备容易地处理。为了为IOT或Cybericsics应用程序的移动或更多尖端设备启用大型CNN模型，我们提出了一种用于CNN推理加速的有效的片上内存架构，并将其应用于我们内部通用深度学习加速器。重新设计的片上存储器子系统MEMSqueEzer包括一个有效缓冲区集和数据缓冲器集，可分别包含专用压缩方法以减少CNN重量和数据集的占地面积。 MEMSQUEEZER缓冲区可以根据其不同的特征压缩数据和权重设置，并且还包括内置冗余检测机制，其通过消除数据冗余来主动扫描CNN的工作组来提高其推理性能。在我们的实验中，表明，具有MEMSQUEEZER缓冲器的CNN加速器达到了超过2倍的性能改进，并在具有相同区域预算的传统缓冲设计中平均降低了80％的能耗。

著录项

来源
《IEEE/ACM International Conference on Computer-Aided Design》|2016年|1-6|共6页
会议地点
作者
Ying Wang; Huawei Li; Xiaowei Li;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
System-on-chip; Bandwidth; Machine learning; Hardware; Throughput; Indexes; Buffer storage;

机译：片上系统;带宽;机器学习;硬件;吞吐量;索引;缓冲区存储;

相似文献

外文文献
中文文献
专利

1. A Low-Power Processor With Configurable Embedded Machine-Learning Accelerators for High-Order and Adaptive Analysis of Medical-Sensor Signals [J] . Lee, K.H., Verma, IEEE Journal of Solid-State Circuits . 2013,第7期

机译：具有可配置嵌入式机器学习加速器的低功耗处理器，可对医学传感器信号进行高阶和自适应分析
2. On-Chip Message Passing Sub-System for Embedded Inter-Domain Communication [J] . P. Garcia, T. Gomes, J. Monteiro, IEEE computer architecture letters . 2016,第1期

机译：用于嵌入式域间通信的片上消息传递子系统
3. IBM POWER7+ processor on-chip accelerators for cryptography and active memory expansion [J] . Blaner B., Abali B., Bass B.M., IBM Journal of Research and Development . 2013,第6期

机译：IBM POWER7 +处理器片上加速器，用于加密和活动内存扩展
4. Re-architecting the on-chip memory sub-system of machine-learning accelerator for embedded devices [C] . Ying Wang, Huawei Li, Xiaowei Li IEEE/ACM International Conference on Computer-Aided Design . 2016

机译：重新构建用于嵌入式设备的计算机学习加速器的片上存储器子系统
5. Low-power biomedical processors with embedded machine-learning accelerators for analytically-intractable physiological signals. [D] . Lee, Kyong Ho. 2013

机译：具有嵌入式机器学习加速器的低功率生物医学处理器，可处理难以解析的生理信号。
6. Visual light perceptions caused by medical linear accelerator: Findings of machine-learning algorithms in a prospective questionnaire-based case–control study [O] . Chao-Yang Kuo, Cheng-Chun Lee, Yuh-Lin Lee, 2021

机译：医用线性加速器引起的视觉光看法：基于前瞻性问卷的案例控制研究中的机器学习算法的调查结果
7. DNN+NeuroSim V2.0: An End-to-End Benchmarking Framework for Compute-in-Memory Accelerators for On-chip Training [O] . Xiaochen Peng, Shanshi Huang, Hongwu Jiang, 2020

机译：DNN + Neurosim V2.0：用于片上培训的内部计算内存加速器的端到端基准框架

Re-architecting the on-chip memory sub-system of machine-learning accelerator for embedded devices

摘要

著录项

相似文献

相关主题

期刊订阅