首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >X1000 real-time phoneme recognition VLSI using feed-forward deep neural networks
【24h】

X1000 real-time phoneme recognition VLSI using feed-forward deep neural networks

机译:使用前馈深度神经网络的X1000实时音素识别VLSI

获取原文

摘要

Deep neural networks show very good performance in phoneme and speech recognition applications when compared to previously used GMM (Gaussian Mixture Model)-based ones. However, efficient implementation of deep neural networks is difficult because the network size needs to be very large when high recognition accuracy is demanded. In this work, we develop a digital VLSI for phoneme recognition using deep neural networks and assess the design in terms of throughput, chip size, and power consumption. The developed VLSI employs a fixed-point optimization method that only uses +Δ, 0, and −Δ for representing each of the weight. The design employs 1,024 simple processing units in each layer, which however can be scaled easily according to the needed throughput, and the throughput of the architecture varies from 62.5 to 1,000 times of the real-time processing speed.
机译:与以前使用的基于GMM(高斯混合模型)的神经网络相比,深度神经网络在音素和语音识别应用中显示出非常好的性能。但是,由于需要高识别精度时网络规模非常大,因此深度神经网络的有效实现非常困难。在这项工作中,我们开发了用于使用深度神经网络进行音素识别的数字VLSI,并在吞吐量,芯片尺寸和功耗方面评估了设计。所开发的VLSI采用仅使用+Δ,0和-Δ表示每个权重的定点优化方法。该设计在每层中使用1,024个简单处理单元,但是可以根据所需的吞吐量轻松进行缩放,并且体系结构的吞吐量从实时处理速度的62.5倍到1,000倍不等。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号