基于循环神经网络的藏语语音识别声学模型

黄晓辉; 李京

首页> 中文期刊>中文信息学报 >基于循环神经网络的藏语语音识别声学模型

基于循环神经网络的藏语语音识别声学模型

开具论文收录证明 >>

期刊封面封底目录下载 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The recurrent neural network and the connectionist temporal classification algorithm are applied to the a-coustic modeling of Tibetan speech recognition ,so as to achieve end-to-end model training .According to the rela-tionship between the input and output of the acoustic model ,the time domain convolution operation on the output se-quence of the hidden layer is introduced to reduce the time domain expansion of the network's hidden layers .Experi-mental results show that the recurrent neural network model achieves better recognition performance in Tibetan Lha-sa phoneme recognition compared with the traditional acoustic models based on Hidden Markov Model ,while the a-coustic model based on recurrent neural network with time-domain convolution possesses higher training and deco-ding efficiency while maintaining the same recognition performance .%探索将循环神经网络和连接时序分类算法应用于藏语语音识别声学建模,实现端到端的模型训练.同时根据声学模型输入与输出的关系,通过在隐含层输出序列上引入时域卷积操作来对网络隐含层时域展开步数进行约简,从而有效提升模型的训练与解码效率.实验结果显示,与传统基于隐马尔可夫模型的声学建模方法相比,循环神经网络模型在藏语拉萨话音素识别任务上具有更好的识别性能,而引入时域卷积操作的循环神经网络声学模型在保持同等识别性能的情况下,拥有更高的训练和解码效率.

著录项

来源
《中文信息学报》|2018年第5期|49-55|共7页
作者
黄晓辉; 李京;
展开▼
作者单位

中国科学技术大学计算机科学与技术学院 ,安徽合肥230027;

解放军外国语学院河南洛阳471003;

中国科学技术大学计算机科学与技术学院 ,安徽合肥230027;

展开▼
原文格式 PDF
正文语种 chi
中图分类信息处理（信息加工）;
关键词
循环神经网络; 藏语语音识别; 声学建模; 时域卷积;

相似文献

中文文献
外文文献
专利

1. 藏语拉萨话大词表连续语音识别声学模型研究 [J] . 李冠宇 ,孟猛 . 计算机工程 . 2012,第005期
2. 基于改进门控单元神经网络的语音识别声学模型研究 [J] . 俞建强 ,颜雁 ,刘葳 . 长春理工大学学报（自然科学版） . 2020,第001期
3. 基于声学模型的不良语音识别技术研究 [J] . 杜刚 ,朱艳云 ,张晨 . 电信工程技术与标准化 . 2019,第012期
4. 基于自适应心理声学模型的智能语音识别系统 [J] . 熊笑颜 ,陈栩 ,黄灿英 . 沈阳工业大学学报 . 2017,第006期
5. 基于HMM模型语音识别系统中声学模型的建立 [J] . 胡石 ,章毅 ,陈芳 . 通讯世界 . 2017,第008期
6. 基于DNN与RNN声学模型融和的语音识别研究 [C] . Huifeng Zhu ,朱会峰 ,Yong He . 第十三届全国人机语音通讯学术会议 . 2015
7. 基于双向循环神经网络的藏语语音识别研究 [A] . 刘秀秀 . 2020

基于循环神经网络的藏语语音识别声学模型

摘要

著录项

相似文献

相关主题

期刊订阅