首页> 外文会议>International conference on computational science >Gated Convolutional LSTM for Speech Commands Recognition
【24h】

Gated Convolutional LSTM for Speech Commands Recognition

机译:门控卷积LSTM用于语音命令识别

获取原文

摘要

As the mobile device gaining increasing popularity, Acoustic Speech Recognition on it is becoming a leading application. Unfortunately, the limited battery and computational resources on a mobile device highly restrict the potential of Speech Recognition systems, most of which have to resort to a remote server for better performance. To improve the performance of local Speech Recognition, we propose C-1-G-2-Blstm. This model shares Convolutional Neural Network's ability of learning local feature and Recurrent Neural Network's ability of learning sequence data's long dependence. Furthermore, by adopting the Gated Convolutional Neural Network instead of a traditional CNN, we manage to greatly improve the model's capacity. Our tests demonstrate that C-1-G-2-Blstm can achieve a high accuracy at 90.6% on the Google Speech Commands data set, which is 6.4% higher than the state-of-art methods.
机译:随着移动设备变得越来越流行,其上的语音识别已经成为领先的应用程序。不幸的是,移动设备上有限的电池和计算资源极大地限制了语音识别系统的潜力,其中大多数语音识别系统必须求助于远程服务器才能获得更好的性能。为了提高本地语音识别的性能,我们提出了C-1-G-2-Blstm。该模型具有卷积神经网络学习局部特征的能力和递归神经网络学习序列数据的长期依赖性的能力。此外,通过采用门控卷积神经网络代替传统的CNN,我们设法大大提高了模型的容量。我们的测试表明,C-1-G-2-Blstm在Google Speech Commands数据集上可以达到90.6%的高精度,比最先进的方法高6.4%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号