Cluster Pruning: An Efficient Filter Pruning Method for Edge AI Vision Applications

Chinthaka Gamanayake; Lahiru Jayasinghe; Benny Kai Kiat Ng; Chau Yuen

首页> 外文期刊>Selected Topics in Signal Processing, IEEE Journal of >Cluster Pruning: An Efficient Filter Pruning Method for Edge AI Vision Applications

【24h】

Cluster Pruning: An Efficient Filter Pruning Method for Edge AI Vision Applications

机译：群集修剪：EDGE AI视觉应用的高效滤波器修剪方法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Even though the Convolutional Neural Networks (CNN) has shown superior results in the field of computer vision, it is still a challenging task to implement computer vision algorithms in real-time at the edge, especially using a low-cost IoT device due to high memory consumption and computation complexities in a CNN. Network compression methodologies such as weight pruning, filter pruning, and quantization are used to overcome the above mentioned problem. Even though filter pruning methodology has shown better performances compared to other techniques, irregularity of the number of filters pruned across different layers of a CNN might not comply with majority of the neural computing hardware architectures. In this paper, a novel greedy approach called cluster pruning has been proposed, which provides a structured way of removing filters in a CNN by considering the importance of filters and the underlying hardware architecture. The proposed methodology is compared with the conventional filter pruning algorithm on Pascal-VOC open dataset, and Head-Counting dataset, which is our own dataset developed to detect and count people entering a room. We benchmark our proposed method on three hardware architectures, namely CPU, GPU, and Intel Movidius Neural Computer Stick (NCS) using the popular SSD-MobileNet and SSD-SqueezeNet neural network architectures used for edge-AI vision applications. Results demonstrate that our method outperforms the conventional filter pruning methodology, using both datasets on above mentioned hardware architectures. Furthermore, a low cost IoT hardware setup consisting of an Intel Movidius-NCS is proposed to deploy an edge-AI application using our proposed pruning methodology.

机译：即使卷积神经网络（CNN）在计算机视野领域所示，它仍然是一个具有挑战性的任务，用于在边缘实时实现计算机视觉算法，特别是由于高成本的低成本IOT设备内存消耗和CNN中的计算复杂性。使用维度修剪，过滤器修剪和量化等网络压缩方法来克服上述问题。尽管与其他技术相比，过滤器修剪方法显示了更好的性能，但是在CNN的不同层上修剪的过滤器的数量的不规则可能不符合神经计算硬件架构的大多数。本文提出了一种称为集群修剪的新颖贪婪方法，它通过考虑滤波器和底层硬件架构的重要性提供了CNN中的滤波器的结构化方式。将所提出的方法与Pascal-Voc Open DataSet上的传统滤波器修剪算法进行比较，以及头部计数数据集，这是我们自己的数据集，用于检测和计算进入房间的人。我们在三个硬件架构，即CPU，GPU和Intel MovIdius神经计算机棒（NCS）上基准测试我们的三个硬件架构，即使用用于Edge-AI视觉应用程序的流行SSD-MobileNet和SSD-Screezenet神经网络架构。结果表明，我们的方法优于传统的滤波器修剪方法，使用上述硬件架构上的两个数据集。此外，建议使用Intel MovIdius-NC组成的低成本IOT硬件设置，以使用我们提出的修剪方法部署边缘AI应用程序。

著录项

来源
《Selected Topics in Signal Processing, IEEE Journal of》 |2020年第4期|802-816|共15页
作者
Chinthaka Gamanayake; Lahiru Jayasinghe; Benny Kai Kiat Ng; Chau Yuen;
展开▼
作者单位

Singapore University of Technology and Design Singapore Singapore;

Singapore University of Technology and Design Singapore Singapore;

Singapore University of Technology and Design Singapore Singapore;

Singapore University of Technology and Design Singapore Singapore;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Hardware; Convolution; Performance evaluation; Memory management; Neural networks; Real-time systems;

机译：硬件;卷积;绩效评估;内存管理;神经网络;实时系统;

相似文献

外文文献
中文文献
专利

1. EasiEdge: A Novel Global Deep Neural Networks Pruning Method for Efficient Edge Computing [J] . Fang Yu, Li Cui, Pengcheng Wang, Internet of Things Journal, IEEE . 2021,第3期

机译：EASEIDGE：一种新的全球深度神经网络修剪方法，用于高效的高效计算方法
2. A Spectral Clustering Based Filter-Level Pruning Method for Convolutional Neural Networks [J] . Lianqiang LI, Jie ZHU, Ming-Ting SUN IEICE transactions on information and systems . 2019,第12期

机译：卷积神经网络基于谱聚类的滤波级修剪方法
3. A new and efficient tree pruning method and its applications to workflow-based PPN nets for financial controls [J] . Jason C.H. Chen, Jack Penm, R. Deane Terrell International journal of information and decision sciences . 2011,第2期

机译：一种新的高效树修剪方法及其在基于工作流的PPN网中进行财务控制的应用
4. A Novel Clustering-Based Filter Pruning Method for Efficient Deep Neural Networks [C] . Xiaohui Wei, Xiaoxian Shen, Changbao Zhou, International Conference on Algorithms and Architectures for Parallel Processing . 2020

机译：一种用于高效深神经网络的基于组聚类的滤波器修剪方法
5. Spatio-temporal filtering for images and videos: Applications on quality enhancement, coding and data pruning [D] . Vo, Dung Trung 2009

机译：图像和视频的时空过滤：在质量增强，编码和数据修剪上的应用
6. A Soft-Pruning Method Applied During Training of Spiking Neural Networks for In-memory Computing Applications [O] . Yuhan Shi, Leon Nguyen, Sangheon Oh, 2010

机译：一种用于内存计算应用的尖峰神经网络训练中的软修剪方法
7. Cluster Pruning: An Efficient Filter Pruning Method for Edge AI Vision Applications [O] . Chinthaka Gamanayake, Lahiru Jayasinghe, Benny Kai Kiat Ng, 2020

机译：群集修剪：EDGE AI视觉应用的高效滤波器修剪方法

Cluster Pruning: An Efficient Filter Pruning Method for Edge AI Vision Applications

摘要

著录项

相似文献

相关主题

期刊订阅