首页> 外文会议>Symposium on VLSI Circuits >SNAP: A 1.67 — 21.55TOPS/W Sparse Neural Acceleration Processor for Unstructured Sparse Deep Neural Network Inference in 16nm CMOS

【24h】

SNAP: A 1.67 — 21.55TOPS/W Sparse Neural Acceleration Processor for Unstructured Sparse Deep Neural Network Inference in 16nm CMOS

机译：SNAP：用于1.6nm CMOS中非结构化稀疏深度神经网络推理的1.67 — 21.55TOPS / W稀疏神经加速处理器

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

A Sparse Neural Acceleration Processor (SNAP) is designed to exploit unstructured sparsity in deep neural networks (DNNs). SNAP uses parallel associative search to discover input pairs to maintain an average 75% hardware utilization. SNAP's two-level partial sum reduce eliminates access contention and cuts the writeback traffic by 22×. Through diagonal and row configurations of PE arrays, SNAP supports any CONV and FC layers. A 2.4mm² 16nm SNAP test chip is measured to achieve a peak effectual efficiency of 21.55TOPS/W (16b) at 0.55V and 260MHz for CONV layers with 10% weight and activation density. Operating on pruned ResNet-50, SNAP achieves 90.98fps at 0.80V and 480MHz, dissipating 348mW.

机译：稀疏神经加速处理器（SNAP）旨在利用深度神经网络（DNN）中的非结构稀疏性。 SNAP使用并行关联搜索来发现输入对，以维持平均75％的硬件利用率。 SNAP的两级部分和减少减少了访问争用，并将回写流量减少了22倍。通过PE阵列的对角线和行配置，SNAP支持任何CONV和FC层。一个2.4毫米 ^{2
对于具有10％重量和激活密度的CONV层，对16nm SNAP测试芯片进行了测量，以在0.55V和260MHz时达到21.55TOPS / W（16b）的峰值有效效率。在修剪的ResNet-50上运行时，SNAP在0.80V和480MHz时可达到90.98fps，耗散348mW。}

著录项

来源
《Symposium on VLSI Circuits》|2019年|306-307|共2页
会议地点
作者
Jie-Fang Zhang; Ching-En Lee; Chester Liu; Yakun Sophia Shao; Stephen W. Keckler; Zhengya Zhang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Very large scale integration;

机译：大规模整合;

相似文献

外文文献
中文文献
专利

1. SNAP: An Efficient Sparse Neural Acceleration Processor for Unstructured Sparse Deep Neural Network Inference [J] . Zhang Jie-Fang, Lee Ching-En, Liu Chester, IEEE Journal of Solid-State Circuits . 2021,第2期

机译：SNAP：一个有效的稀疏神经加速处理器，用于非结构化稀疏深神经网络推理
2. An Energy-Efficient Deep Convolutional Neural Network Inference Processor With Enhanced Output Stationary Dataflow in 65-nm CMOS [J] . IEEE transactions on very large scale integration (VLSI) systems . 2020,第1期

机译：节能型深度卷积神经网络推理处理器，具有增强的65nm CMOS输出固定数据流
3. Deep Neural Network Acceleration With Sparse Prediction Layers [J] . Yao Zhongtian, Huang Kejie, Shen Haibin, Quality Control, Transactions . 2020,第期

机译：具有稀疏预测层的深神经网络加速度
4. SNAP: A 1.67 — 21.55TOPS/W Sparse Neural Acceleration Processor for Unstructured Sparse Deep Neural Network Inference in 16nm CMOS [C] . Jie-Fang Zhang, Ching-En Lee, Chester Liu, Symposium on VLSI Circuits . 2019

机译：SNAP：1.67 - 21.55TOP /带稀疏神经加速处理器，用于非结构化稀疏深度神经网络推理16 nm CMOS
5. Joint Optimization of Quantization and Structured Sparsity for Compressed Deep Neural Networks [D] . Srivastava, Gaurav. 2018

机译：压缩深神经网络的量化和结构稀疏性的联合优化
6. Sparse deep neural networks on imaging genetics for schizophrenia case–control classification [O] . Jiayu Chen, Xiang Li, Vince D. Calhoun, 2021

机译：精神分裂症案例控制分类的成像遗传学稀疏深神经网络
7. Accelerating Training of Deep Neural Networks via Sparse Edge Processing [O] . Dey, Sourya, Shao, Yinan, Chugg, Keith M., 2017

机译：通过稀疏边缘处理加速深度神经网络的训练

SNAP: A 1.67 — 21.55TOPS/W Sparse Neural Acceleration Processor for Unstructured Sparse Deep Neural Network Inference in 16nm CMOS

摘要

著录项

相似文献

相关主题

期刊订阅