SPART: Optimizing CNNs by Utilizing Both Sparsity of Weights and Feature Maps

机译：spart：利用权重和特征映射的稀疏性优化CNN

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Intense convolution computation and great memory requirement in CNNs constraint their wider deployments and applications. Although both the weights and feature maps in CNNs can be sparse, directly mapping sparse convolution to spGEMM in HPC domain fails to improve the actual performance. Besides, existing sparse formats like CSR are not suitable for encoding the sparse feature maps because convolution operates across rows. In this work, we propose a new format and a novel sparse convolution algorithm to optimize sparse CNNs on GPUs. First, we design the Compressed Feature Map (CFM) format to store the sparse feature maps. Second, we propose an efficient sparse convolution algorithm called SPART with sparse weights and sparse feature maps. Finally, we optimize this algorithm on GPUs. Our experiments show that our SPART algorithm has good performance. Compared with dense convolution, the speedup of SPART is up to 2.62× (1.77× in average) on V100 and up to 1.84× (1.24× in average) on Titan X.

机译：CNNS约束中强大的卷积计算和巨大的内存要求他们更广泛的部署和应用程序。虽然CNN中的权重和特征映射都可以稀疏，但直接将稀疏卷积映射到HPC域中的SPGEMM无法提高实际性能。此外，CSR等现有的稀疏格式不适合编码稀疏特征映射，因为卷积横跨行运行。在这项工作中，我们提出了一种新的格式和新的稀疏卷积算法，可以在GPU上优化稀疏CNN。首先，我们设计压缩的特征映射（CFM）格式来存储稀疏功能映射。其次，我们提出了一种称为Spart的有效稀疏卷积算法，具有稀疏权重和稀疏特征映射。最后，我们在GPU上优化该算法。我们的实验表明，我们的Spart算法具有良好的性能。与密集卷积相比，在V100上的SPART的加速度高达2.62倍（1.77×（1.77×（1.24×平均平均1.24×平均平均）。

著录项

来源
《International Symposium on Advanced Parallel Processing Technologies》|2019年|xii 149 p.|共15页
会议地点
作者
Jiaming Xie; Yun Liang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类理论、方法;
关键词
CNN; Sparse; Convolution; Format;

机译：CNN;稀疏;卷积;格式;

相似文献

外文文献
中文文献
专利

1. Weight Sparseness for a Feature-Map-Split-CNN Toward Low-Cost Embedded FPGAs [J] . Akira JINGUJI, Shimpei SATO, Hiroki NAKAHARA IEICE transactions on information and systems . 2021,第12期

机译：Feature-Map-Split-CNN的重量稀疏，用于低成本嵌入式FPGA
2. JOWMDroid: Android malware detection based on feature weighting with joint optimization of weight-mapping and classifier parameters [J] . Lingru Cai, Yao Li, Zhi Xiong Computers & Security . 2021,第Jana期

机译：JOWMDROID：基于具有重量映射和分类器参数的联合优化的功能加权的Android恶意软件检测
3. Crane: Mitigating Accelerator Under-utilization Caused by Sparsity Irregularities in CNNs [J] . Guan Yijin, Sun Guangyu, Yuan Zhihang, IEEE Transactions on Computers . 2020,第7期

机译：起重机：通过CNNS中的稀疏性违规行为引起的速度造成的促进剂
4. SPART: Optimizing CNNs by Utilizing Both Sparsity of Weights and Feature Maps [C] . Jiaming Xie, Yun Liang International Symposium on Advanced Parallel Processing Technologies . 2019

机译：SPART：通过同时使用权重稀疏度和特征图来优化CNN
5. Pluricanonical maps for threefolds of general type and the Gromov-Witten potential for the local projective line with weights one and two. [D] . Todorov, Gueorgui Tomov. 2008

机译：多重类型的多重映象以及权重为1和2的局部投影线的Gromov-Witten势。
6. Automatic Breast Tumor Diagnosis in MRI Based on a Hybrid CNN and Feature-Based Method Using Improved Deer Hunting Optimization Algorithm [O] . Weitao Ha, Zahra Vahedi 2021

机译：基于混合CNN的MRI自动乳腺肿瘤诊断和使用改进的鹿狩猎优化算法的基于混合CNN和特征的方法
7. OMNI: A Framework for Integrating Hardware and Software Optimizations for Sparse CNNs [O] . Yun Liang, Liqiang Lu, Jiaming Xie 2020

机译：OMNI：用于集成硬件和软件优化的框架，用于稀疏CNN

SPART: Optimizing CNNs by Utilizing Both Sparsity of Weights and Feature Maps

摘要

著录项

相似文献

相关主题

期刊订阅