Deep neural networks for syllable based acoustic modeling in Chinese speech recognition

机译：语音识别中基于音节的声学模型的深度神经网络

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recently, the deep neural networks (DNNs) based acoustic modeling methods have been successfully applied to many speech recognition tasks. This paper reports the work about applying DNNs for syllable based acoustic modeling in Chinese automatic speech recognition (ASR). Compared with initial/finals (IFs), syllable can implicitly model the intra-syllable variations in better accuracy. However, the context dependent syllable based modeling set holds too many units, bringing about heavy problems on modeling and decoding implementation. In this paper, a WFST decoding framework is applied. Moreover, the decision tree based state tying and DNNs based models are discussed for the acoustic model training. The experimental results show that compared with the traditional IFs based modeling method, the proposed syllable modeling method using DNNs is more robust for data sparsity problem, which indicates that it has the potential to obtain better performance for Chinese ASR.

机译：最近，基于深度神经网络（DNN）的声学建模方法已成功应用于许多语音识别任务。本文报告了在中国自动语音识别（ASR）中应用基于音节的声学建模的DNN的工作。与初始/最终（IFS）相比，音节可以通过更好的准确度隐式模拟音节内变化。但是，基于上下文的基于音节的建模集保持了太多单位，在建模和解码实现上引发了沉重的问题。本文应用了WFST解码框架。此外，讨论了用于声学模型训练的基于决策树的状态和基于DNN的模型。实验结果表明，与传统的基于IFS的建模方法相比，使用DNN的建议的音节建模方法对于数据稀疏问题更加强大，这表明它有可能为中国ASR获得更好的性能。

著录项

来源
《Asia-Pacific Signal and Information Processing Association Annual Summit and Conference》|2013年|1-4|共4页
会议地点
作者
Li Xiangang; Hong Caifu; Yang Yuning; Wu Xihong;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. A comparative study on selecting acoustic modeling units in deep neural networks based large vocabulary Chinese speech recognition [J] . Li Xiangang, Yang Yuning, Pang Zaihu, Neurocomputing . 2015,第deca25期

机译：基于大词汇量中文语音识别的深度神经网络中声学建模单元选择的比较研究
2. A Speaker-Dependent Approach to Single-Channel Joint Speech Separation and Acoustic Modeling Based on Deep Neural Networks for Robust Recognition of Multi-Talker Speech [J] . Yan-Hui Tu, Jun Du, Chin-Hui Lee Journal of signal processing systems for signal, image, and video technology . 2018,第7期

机译：基于说话者的基于深度神经网络的单通道联合语音分离和声学建模方法，用于多语音对话的鲁棒识别
3. Ensemble of jointly trained deep neural network-based acoustic models for reverberant speech recognition [J] . Lee Moa, Lee Jeehye, Chang Joon-Hyuk Digital Signal Processing . 2019,第期

机译：混响语音识别的联合训练深神经网络声学模型的集合
4. Deep neural networks for syllable based acoustic modeling in Chinese speech recognition [C] . Li Xiangang, Hong Caifu, Yang Yuning, Asia-Pacific Signal and Information Processing Association Annual Summit and Conference . 2013

机译：基于音节的汉语语音识别声学建模的深神经网络
5. Dysarthric Speech Recognition and Offline Handwriting Recognition using Deep Neural Networks. [D] . Pillai, Suhas Balkrishna. 2017

机译：使用深度神经网络的表情异常语音识别和离线手写识别。
6. Improving Robustness of Deep Neural Network Acoustic Models via Speech Separation and Joint Adaptive Training [O] . Arun Narayanan, DeLiang Wang -1

机译：通过语音分离和联合自适应训练提高深度神经网络声学模型的鲁棒性
7. Efficient Acoustic Modeling Method for Unsupervised Speech Recognition using Multi-Task Deep Neural Network [O] . Haitao Yao, Maobo An, Ji Xu, 2016

机译：使用多任务深神经网络的无监督语音识别有效的声学建模方法

Deep neural networks for syllable based acoustic modeling in Chinese speech recognition

摘要

著录项

相似文献

相关主题

期刊订阅