...
首页> 外文期刊>Selected Topics in Signal Processing, IEEE Journal of >Cluster Pruning: An Efficient Filter Pruning Method for Edge AI Vision Applications
【24h】

Cluster Pruning: An Efficient Filter Pruning Method for Edge AI Vision Applications

机译:群集修剪:EDGE AI视觉应用的高效滤波器修剪方法

获取原文
获取原文并翻译 | 示例
           

摘要

Even though the Convolutional Neural Networks (CNN) has shown superior results in the field of computer vision, it is still a challenging task to implement computer vision algorithms in real-time at the edge, especially using a low-cost IoT device due to high memory consumption and computation complexities in a CNN. Network compression methodologies such as weight pruning, filter pruning, and quantization are used to overcome the above mentioned problem. Even though filter pruning methodology has shown better performances compared to other techniques, irregularity of the number of filters pruned across different layers of a CNN might not comply with majority of the neural computing hardware architectures. In this paper, a novel greedy approach called cluster pruning has been proposed, which provides a structured way of removing filters in a CNN by considering the importance of filters and the underlying hardware architecture. The proposed methodology is compared with the conventional filter pruning algorithm on Pascal-VOC open dataset, and Head-Counting dataset, which is our own dataset developed to detect and count people entering a room. We benchmark our proposed method on three hardware architectures, namely CPU, GPU, and Intel Movidius Neural Computer Stick (NCS) using the popular SSD-MobileNet and SSD-SqueezeNet neural network architectures used for edge-AI vision applications. Results demonstrate that our method outperforms the conventional filter pruning methodology, using both datasets on above mentioned hardware architectures. Furthermore, a low cost IoT hardware setup consisting of an Intel Movidius-NCS is proposed to deploy an edge-AI application using our proposed pruning methodology.
机译:即使卷积神经网络(CNN)在计算机视野领域所示,它仍然是一个具有挑战性的任务,用于在边缘实时实现计算机视觉算法,特别是由于高成本的低成本IOT设备内存消耗和CNN中的计算复杂性。使用维度修剪,过滤器修剪和量化等网络压缩方法来克服上述问题。尽管与其他技术相比,过滤器修剪方法显示了更好的性能,但是在CNN的不同层上修剪的过滤器的数量的不规则可能不符合神经计算硬件架构的大多数。本文提出了一种称为集群修剪的新颖贪婪方法,它通过考虑滤波器和底层硬件架构的重要性提供了CNN中的滤波器的结构化方式。将所提出的方法与Pascal-Voc Open DataSet上的传统滤波器修剪算法进行比较,以及头部计数数据集,这是我们自己的数据集,用于检测和计算进入房间的人。我们在三个硬件架构,即CPU,GPU和Intel MovIdius神经计算机棒(NCS)上基准测试我们的三个硬件架构,即使用用于Edge-AI视觉应用程序的流行SSD-MobileNet和SSD-Screezenet神经网络架构。结果表明,我们的方法优于传统的滤波器修剪方法,使用上述硬件架构上的两个数据集。此外,建议使用Intel MovIdius-NC组成的低成本IOT硬件设置,以使用我们提出的修剪方法部署边缘AI应用程序。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号