首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Compressing Deep Neural Networks Using Toeplitz Matrix: Algorithm Design and Fpga Implementation
【24h】

Compressing Deep Neural Networks Using Toeplitz Matrix: Algorithm Design and Fpga Implementation

机译:使用Toeplitz矩阵压缩深层神经网络:算法设计和Fpga实现

获取原文

摘要

Deep neural networks (DNNs) have emerged as an important artificial intelligence technique. However, the computation-intensive and storage-intensive DNNs pose severe challenges on efficient execution over the underlying hardware platform. In this paper we propose to impose Toeplitz structure on DNN models to achieve high compression ratio with negligible performance loss. Accordingly, the hardware performance can be significantly improved after performing model compression. We evaluate the proposed approach on speech recognition and implement the corresponding compressed model on FPGA. Experimental results show that our approach enables high hardware performance while retaining high task performance.
机译:深度神经网络(DNN)被出现为重要的人工智能技术。然而,计算密集型和存储密集型DNN在底层硬件平台上有效执行严重挑战。在本文中,我们建议在DNN模型上施加Toeplitz结构,以实现高压缩比,具有可忽略的性能损失。因此,在执行模型压缩之后,可以显着改善硬件性能。我们评估语音识别的提出方法,并在FPGA上实现相应的压缩模型。实验结果表明,我们的方法能够在保持高任务性能的同时实现高硬件性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号