Device-Circuit-Architecture Co-Exploration for Computing-in-Memory Neural Accelerators

Jiang Weiwen; Lou Qiuwen; Yan Zheyu; Yang Lei; Hu Jingtong; Hu Xiaobo Sharon; Shi Yiyu

首页> 外文期刊>IEEE Transactions on Computers >Device-Circuit-Architecture Co-Exploration for Computing-in-Memory Neural Accelerators

【24h】

Device-Circuit-Architecture Co-Exploration for Computing-in-Memory Neural Accelerators

机译：用于计算内存神经加速器的设备电路架构共同探索

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Co-exploration of neural architectures and hardware design is promising due to its capability to simultaneously optimize network accuracy and hardware efficiency. However, state-of-the-art neural architecture search algorithms for the co-exploration are dedicated for the conventional von-Neumann computing architecture, whose performance is heavily limited by the well-known memory wall. In this article, we are the first to bring the computing-in-memory architecture, which can easily transcend the memory wall, to interplay with the neural architecture search, aiming to find the most efficient neural architectures with high network accuracy and maximized hardware efficiency. Such a novel combination makes opportunities to boost performance, but also brings a bunch of challenges: The optimization space spans across multiple design layers from device type and circuit topology to neural architecture; and the presence of device variation may drastically degrade the neural network performance. To address these challenges, we propose a cross-layer exploration framework, namely NACIM, which jointly explores device, circuit and architecture design space and takes device variation into consideration to find the most robust neural architectures, coupled with the most efficient hardware design. Experimental results demonstrate that NACIM can find the robust neural network with 0.45 percent accuracy loss in the presence of device variation, compared with a 76.44 percent loss from the state-of-the-art NAS without consideration of variation; in addition, NACIM achieves an energy efficiency up to 16.3 TOPs/W, 3.17x higher than the state-of-the-art NAS.

机译：神经架构和硬件设计的共同探索是由于其同时优化网络精度和硬件效率的能力。然而，用于共同探索的最先进的神经结构搜索算法专用于传统的von-neumann计算架构，其性能受到众所周知的存储壁的严重限制。在本文中，我们是第一个带来计算内存架构的，可以轻松超越内存墙，以与神经结构搜索相互作用，旨在找到具有高网络精度和最大化硬件效率的最高效的神经结构。这种新颖的组合使得能够提高性能的机会，但也带来了一堆挑战：优化空间跨多个设计层，从设备类型和电路拓扑到神经结构;并且设备变化的存在可能会急剧地降低神经网络性能。为了解决这些挑战，我们提出了一个跨层探索框架，即Nacim，它共同探索了设备，电路和架构设计空间，并考虑了设备变化，以找到最强大的神经结构，与最有效的硬件设计相结合。实验结果表明，NACIM可以在存在设备变化的情况下发现稳健的神经网络，精度损失0.45％，而最先进的NAS损失的76.44％，而不考虑变化;此外，Nacim实现了高达16.3顶部/型的能量效率，比最先进的NAS高3.17倍。

著录项

来源
《IEEE Transactions on Computers》 |2021年第4期|595-605|共11页
作者
Jiang Weiwen; Lou Qiuwen; Yan Zheyu; Yang Lei; Hu Jingtong; Hu Xiaobo Sharon; Shi Yiyu;
展开▼
作者单位

Univ Notre Dame Dept Comp Sci & Engn Notre Dame IN 46556 USA;

Univ Notre Dame Dept Comp Sci & Engn Notre Dame IN 46556 USA;

Univ Notre Dame Dept Comp Sci & Engn Notre Dame IN 46556 USA;

Univ Notre Dame Dept Comp Sci & Engn Notre Dame IN 46556 USA;

Univ Pittsburgh Dept Elect & Comp Engn Pittsburgh PA 15261 USA;

Univ Notre Dame Dept Comp Sci & Engn Notre Dame IN 46556 USA;

Univ Notre Dame Dept Comp Sci & Engn Notre Dame IN 46556 USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Computer architecture; Hardware; Neural networks; Performance evaluation; Optimization; Object recognition; Quantization (signal); Hardware; software co-design; computing-in-memory architecture; neural architecture search; neural network accelerator;

机译：计算机架构;神经网络;性能评估;优化;对象识别;量化（信号）;硬件;软件共同设计;计算内存架构;神经结构搜索;神经网络加速器;

相似文献

外文文献
中文文献
专利

1. A hybrid precision low power computing-in-memory architecture for neural networks [J] . Xu Rui, Tao Linfeng, Wang Tianqi, Microprocessors and microsystems . 2021,第Feba期

机译：用于神经网络的混合精密低功率计算内存架构
2. UL-CNN: An Ultra-Lightweight Convolutional Neural Network Aiming at Flash-Based Computing-In-Memory Architecture for Pedestrian Recognition [J] . Yang Chen, Zhang Jingyu, Chen Qi, Journal of circuits, systems and computers . 2021,第2期

机译：UL-CNN：一种超轻型卷积神经网络，旨在用于行人识别的基于闪存的计算内存器架构
3. Impacts and solutions of nonvolatile-memory-induced weight error in the computing-in-memory neural network system [J] . Lin Yu-Hsuan, Lee Dai-Ying, Wang Chao-Hung, Japanese journal of applied physics . 2020,第SG期

机译：计算内存神经网络系统中非易失性存储器诱导的重量误差的影响和解
4. Uncertainty Modeling of Emerging Device based Computing-in-Memory Neural Accelerators with Application to Neural Architecture Search [C] . Zheyu Yan, Da-Cheng Juan, Xiaobo Sharon Hu, Asia and South Pacific Design Automation Conference . 2021

机译：基于营造设备的内存神经加速器应用于神经架构搜索的不确定性建模
5. Memory-Driven Data-Flow Optimization for Neural Processing Accelerators [D] . Nie, Qi. 2020

机译：用于神经处理加速器的内存驱动数据流优化
6. Boosting Throughput and Efficiency of Hardware Spiking Neural Accelerators Using Time Compression Supporting Multiple Spike Codes [O] . Changqing Xu, Wenrui Zhang, Yu Liu, 2020

机译：使用时间压缩支撑多穗码的硬件尖峰神经加速器的吞吐量和效率
7. Co-Exploration of Graph Neural Network and Network-on-Chip Design Using AutoML [O] . Daniel Manu, Shaoyi Huang, Caiwen Ding, 2021

机译：使用Automl的图形神经网络和网络上设计的共同探索

Device-Circuit-Architecture Co-Exploration for Computing-in-Memory Neural Accelerators

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅