Gated Convolutional LSTM for Speech Commands Recognition

机译：门控卷积LSTM用于语音命令识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

As the mobile device gaining increasing popularity, Acoustic Speech Recognition on it is becoming a leading application. Unfortunately, the limited battery and computational resources on a mobile device highly restrict the potential of Speech Recognition systems, most of which have to resort to a remote server for better performance. To improve the performance of local Speech Recognition, we propose C-1-G-2-Blstm. This model shares Convolutional Neural Network's ability of learning local feature and Recurrent Neural Network's ability of learning sequence data's long dependence. Furthermore, by adopting the Gated Convolutional Neural Network instead of a traditional CNN, we manage to greatly improve the model's capacity. Our tests demonstrate that C-1-G-2-Blstm can achieve a high accuracy at 90.6% on the Google Speech Commands data set, which is 6.4% higher than the state-of-art methods.

机译：随着移动设备变得越来越流行，其上的语音识别已经成为领先的应用程序。不幸的是，移动设备上有限的电池和计算资源极大地限制了语音识别系统的潜力，其中大多数语音识别系统必须求助于远程服务器才能获得更好的性能。为了提高本地语音识别的性能，我们提出了C-1-G-2-Blstm。该模型具有卷积神经网络学习局部特征的能力和递归神经网络学习序列数据的长期依赖性的能力。此外，通过采用门控卷积神经网络代替传统的CNN，我们设法大大提高了模型的容量。我们的测试表明，C-1-G-2-Blstm在Google Speech Commands数据集上可以达到90.6％的高精度，比最先进的方法高6.4％。

著录项

来源
《International conference on computational science》|2018年|669-681|共13页
会议地点
作者
Dong Wang; Shaohe Lv; Xiaodong Wang; Xinye Lin;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Acoustic Speech Recognition; Localize Gated Convolutional Neural Network; Long Short Time Memory;

机译：语音识别定位门控卷积神经网络长短时记忆;

相似文献

外文文献
中文文献
专利

1. LSTM-convolutional-BLSTM encoder-decoder network for minimum mean-square error approach to speech enhancement [J] . Wang Zeyu, Zhang Tao, Shao Yangyang, Applied Acoustics . 2021,第Jana期

机译：LSTM-Convolutional-BLSTM编码器 - 解码器用于语音增强的最小均方误差方法
2. Human action recognition using convolutional LSTM and fully-connected LSTM with different attentions [J] . Zhang Zufan, Lv Zongming, Gan Chenquan, Neurocomputing . 2020,第Octa14期

机译：使用卷积LSTM和完全连接的LSTM具有不同关注的人类行动识别
3. Automatic proficiency assessment of Korean speech read aloud by non‐natives using bidirectional LSTM‐based speech recognition [J] . Yoo Rhee Oh, Kiyoung Park, Hyung‐Bae Jeon, ETRI journal . 2020,第5期

机译：使用基于双向LSTM的语音识别，非洲主义韩国语音的自动能力评估大声朗读
4. Gated Convolutional LSTM for Speech Commands Recognition [C] . Dong Wang, Shaohe Lv, Xiaodong Wang, International Conference on Computational Science . 2018

机译：Gated卷积LSTM用于语音命令识别
5. Convolutional Neural Networks for Speaker-Independent Speech Recognition. [D] . Belilovsky, Eugene. 2011

机译：用于与说话人无关的语音识别的卷积神经网络。
6. Deep Convolutional and LSTM Networks on Multi-Channel Time Series Data for Gait Phase Recognition [O] . David Kreuzer, Michael Munz 2021

机译：用于远程阶段识别的多通道时间序列数据的深度卷积和LSTM网络
7. Low-Activity Supervised Convolutional Spiking Neural Networks Applied to Speech Commands Recognition [O] . Thomas Pellegrini, Romain Zimmer, Timothee Masquelier 2021

机译：低活动监督卷积尖峰神经网络应用于语音命令识别
8. LSTM, GRU, Highway and a Bit of Attention: An Empirical Overview for Language Modeling in Speech Recognition. [R] . Irie, K., Tuske, Z., Alkhouli, T., 2016

机译：LsTm，GRU，公路和一点注意：语音识别中语言建模的经验概述。

Gated Convolutional LSTM for Speech Commands Recognition

摘要

著录项

相似文献

相关主题

期刊订阅