Large Vocabulary Mandarin Speech Recognition With Different Approaches in Modeling Tones

机译：大型词汇普通话语音识别，采用造型色调的不同方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Large vocabulary continuous Mandarin speech recognition has been an important problem for speech recognition researchers for several reasons [1], [3]. First, of all, it is a tonal language that requires special treatment for the modeling of tones. There are five tones in mandarin which are necessary to disambiguate between confusable words. Secondly, the difficulty of entering Chinese by keyboard presents a great opportunity for speech recognition to improve computer usability. Previous approaches to modeling tones have included using a separate tone classifier [1] and incorporating pitch directly into the feature vector [3]. In this paper, we describe a large vocabulary Mandarin speech recognition system based on Microsoft's Whisper system. Several alternatives in modeling tones and their error rates on continuous speech are compared.

机译：大型词汇持续普通话语音识别是语音识别研究人员的重要问题，因为有几个原因[1]，[3]。首先，所有的色调语言需要特殊处理音调的造型。普通话中有五种色调，这是消除混淆词之间的消除歧义。其次，键盘进入中文的难度为语音识别提供了一个很好的机会，以提高计算机可用性。以前的建模音调方法已经包括使用单独的色调分类器[1]并将间距直接结合到特征向量中[3]。在本文中，我们描述了一种基于微软耳语系统的大型词汇普通话语音识别系统。比较了若干建模音调的替代品及其在连续语音上的错误率。

著录项

来源
《International conference on spoken language processing》|2000年||共4页
会议地点
作者
Eric Chang; Jinalai Zhou; Shuo Di; Chao Huang; Kai-Fu Lee;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 G18;
关键词

相似文献

外文文献
中文文献
专利

1. Tone Recognition of Continuous Mandarin Speech Based on Tone Nucleus Model and Neural Network [J] . Xiao-Dong WANG, Keikichi HIROSE, Jin-Song ZHANG, IEICE Transactions on Information and Systems . 2008,第6期

机译：基于音频核模型和神经网络的普通话连续语音识别
2. Tone Recognition of Continuous Mandarin Speech Based on Tone Nucleus Model and Neural Network [J] . Xiao-Dong Wang, Keikichi Hirose, Jin-Song Zhang, 電子情報通信学会技術研究報告. 音声. Speech . 2006,第443期

机译：基于音频核模型和神经网络的普通话连续语音识别
3. Tone Recognition of Continuous Mandarin Speech Based on Tone Nucleus Model and Neural Network [J] . Xiao-Dong Wang, Keikichi Hirose, Jin-Song Zhang, 電子情報通信学会技術研究報告. 言語理解とコミュニケーション. Natural Language Understanding and Models of Communication . 2006,第441期

机译：基于音频核模型和神经网络的普通话连续语音识别
4. Large Vocabulary Mandarin Speech Recognition With Different Approaches in Modeling Tones [C] . Eric Chang, Jinalai Zhou, Shuo Di, 6th International Conference on Spoken Language Processing ICSLP 2000 Oct.16-Oct.20 2000 Beijing International Convention Center, Beijing, China . 2000

机译：语音建模中不同方法的大词汇量普通话语音识别
5. Modeling lexical tones for Mandarin large vocabulary continuous speech recognition. [D] . Lei, Xin. 2006

机译：为普通话大词汇量连续语音识别建模词汇声调。
6. The Binaural Masking-Level Difference of Mandarin Tone Detection and the Binaural Intelligibility-Level Difference of Mandarin Tone Recognition in the Presence of Speech-Spectrum Noise [O] . Cheng-Yu Ho, Pei-Chun Li, Yuan-Chuan Chiang, -1

机译：语音频谱噪声下普通话检测的双耳掩蔽水平差异和普通话识别的双耳可懂度水平差异
7. The binaural masking-level difference of mandarin tone detection and the binaural intelligibility-level difference of mandarin tone recognition in the presence of speech-spectrum noise. [O] . Cheng-Yu Ho, Pei-Chun Li, Yuan-Chuan Chiang, 2015

机译：在存在语音频谱噪声的情况下，普通话音检测的双耳掩蔽级差异和普通话音识别的双耳可懂度级差异。

Large Vocabulary Mandarin Speech Recognition With Different Approaches in Modeling Tones

摘要

著录项

相似文献

相关主题

期刊订阅