SPART: Optimizing CNNs by Utilizing Both Sparsity of Weights and Feature Maps

机译：SPART：通过同时使用权重稀疏度和特征图来优化CNN

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Intense convolution computation and great memory requirement in CNNs constraint their wider deployments and applications. Although both the weights and feature maps in CNNs can be sparse, directly mapping sparse convolution to spGEMM in HPC domain fails to improve the actual performance. Besides, existing sparse formats like CSR are not suitable for encoding the sparse feature maps because convolution operates across rows. In this work, we propose a new format and a novel sparse convolution algorithm to optimize sparse CNNs on GPUs. First, we design the Compressed Feature Map (CFM) format to store the sparse feature maps. Second, we propose an efficient sparse convolution algorithm called SPART with sparse weights and sparse feature maps. Finally, we optimize this algorithm on GPUs. Our experiments show that our SPART algorithm has good performance. Compared with dense convolution, the speedup of SPART is up to 2.62× (1.77× in average) on V100 and up to 1.84× (1.24× in average) on Titan X.

机译：CNN中密集的卷积计算和巨大的内存需求限制了它们的广泛部署和应用。尽管CNN中的权重图和特征图都可以是稀疏的，但是在HPC域中将稀疏卷积直接映射到spGEMM并不能提高实际性能。此外，现有的稀疏格式（例如CSR）不适合对稀疏特征图进行编码，因为卷积跨行进行。在这项工作中，我们提出了一种新的格式和一种新颖的稀疏卷积算法来优化GPU上的稀疏CNN。首先，我们设计压缩特征图（CFM）格式来存储稀疏特征图。其次，我们提出了一种有效的稀疏卷积算法，称为SPART，具有稀疏权重和稀疏特征图。最后，我们在GPU上优化此算法。我们的实验表明，我们的SPART算法具有良好的性能。与密集卷积相比，SPART在V100上的加速高达2.62倍（平均1.77倍），在Titan X上达到1.84倍（平均1.24倍）。

著录项

来源
《International Symposium on Advanced Parallel Processing Technologies》|2019年|71-85|共15页
会议地点 Tianjin(CN)
作者
Jiaming Xie; Yun Liang;
展开▼
作者单位

Peking University Beijing China Peng Cheng Laboratory Shenzhen China;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
CNN; Sparse; Convolution; Format;

机译：CNN;疏;卷积;格式;

相似文献

外文文献
中文文献
专利

1. Weight Sparseness for a Feature-Map-Split-CNN Toward Low-Cost Embedded FPGAs [J] . Akira JINGUJI, Shimpei SATO, Hiroki NAKAHARA IEICE transactions on information and systems . 2021,第12期

机译：Feature-Map-Split-CNN的重量稀疏，用于低成本嵌入式FPGA
2. JOWMDroid: Android malware detection based on feature weighting with joint optimization of weight-mapping and classifier parameters [J] . Lingru Cai, Yao Li, Zhi Xiong Computers & Security . 2021,第Jana期

机译：JOWMDROID：基于具有重量映射和分类器参数的联合优化的功能加权的Android恶意软件检测
3. Crane: Mitigating Accelerator Under-utilization Caused by Sparsity Irregularities in CNNs [J] . Guan Yijin, Sun Guangyu, Yuan Zhihang, IEEE Transactions on Computers . 2020,第7期

机译：起重机：通过CNNS中的稀疏性违规行为引起的速度造成的促进剂
4. SPART: Optimizing CNNs by Utilizing Both Sparsity of Weights and Feature Maps [C] . Jiaming Xie, Yun Liang International Symposium on Advanced Parallel Processing Technologies . 2019

机译：spart：利用权重和特征映射的稀疏性优化CNN
5. Pluricanonical maps for threefolds of general type and the Gromov-Witten potential for the local projective line with weights one and two. [D] . Todorov, Gueorgui Tomov. 2008

机译：多重类型的多重映象以及权重为1和2的局部投影线的Gromov-Witten势。
6. Automatic Breast Tumor Diagnosis in MRI Based on a Hybrid CNN and Feature-Based Method Using Improved Deer Hunting Optimization Algorithm [O] . Weitao Ha, Zahra Vahedi 2021

机译：基于混合CNN的MRI自动乳腺肿瘤诊断和使用改进的鹿狩猎优化算法的基于混合CNN和特征的方法
7. OMNI: A Framework for Integrating Hardware and Software Optimizations for Sparse CNNs [O] . Yun Liang, Liqiang Lu, Jiaming Xie 2020

机译：OMNI：用于集成硬件和软件优化的框架，用于稀疏CNN

SPART: Optimizing CNNs by Utilizing Both Sparsity of Weights and Feature Maps

摘要

著录项

相似文献

相关主题

期刊订阅