MAIN VOWEL DOMAIN TONE MODELING WITH LEXICAL AND PROSODIC ANALYSIS FOR MANDARIN ASR

机译：普通话与普通话分析的主要元音域音色建模

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The tone is a distinctive discriminative feature in Mandarin Chinese. Often functional, yet seldom thorough are most large-scale Mandarin speech recognition systems in treating tone modeling. In particular, many lack the necessary sophistication to deal with the myriad variations arising from the combination of acoustic and lexical contexts. This paper reports an attempt to account for these variabilities and to bring richer tone modeling into the IBM Mandarin broadcast transcription system. In particular, we describe a system that combines the embedded approach and a novel explicit tone modeling technique characterized by a. robust tone tracking in the main-vowel domain, and b. context-dependent models with lexical and prosodic contexts. The proposed method is validated on a connected-digits set and subsequently evaluated on a large-vocabulary broadcast transcription task. It is shown that 14.8percent and 5.4percent relative reductions in character error rate are achieved respectively.

机译：语气是普通话中的一个独特的歧视特征。经常练功，很少彻底是大量大规模的普通话语音识别系统，治疗语气建模。特别是，许多人缺乏对来自声学和词汇表的组合产生的无数变化的必要复杂性。本文报告了试图考虑这些可变性，并将更丰富的口气建模纳入IBM普通话广播转录系统。特别地，我们描述了一种组合嵌入方法的系统和一种新颖的显式音调建模技术，其特征是a。主元音域中的强大音调跟踪，b。与词汇和韵律上下文的上下文相关模型。所提出的方法在连接的数字集上验证，随后在大词汇表广播转录任务上进行评估。结果表明，分别实现了14.8％和5.4分别以字符错误率的相对减少。

著录项

来源
《IEEE International Conference on Acoustics, Speech, and Signal Processing》|2009年||共4页
会议地点
作者
Shilei Zhang; Qin Shi; Stephen M. Chu; Yong Qin;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912-53;
关键词
tone models; decision tree; main vowel; tone domain; lattice rescoring;

机译：音色模型;决定树;主元音;音调;格子救援;

相似文献

外文文献
中文文献
专利

1. Effect of blindness on mismatch responses to Mandarin lexical tones, consonants, and vowels [J] . Feng Jie, Liu Chang, Li Mingshuang, Hearing Research: An International Journal . 2019,第期

机译：失明对普通话对普通话语调色调，辅音和元音的反应的影响
2. Mismatch responses to lexical tone, initial consonant, and vowel in Mandarin-speaking preschoolers [J] . Neuropsychologia . 2012,第14期

机译：普通话学龄前儿童对词汇语调，初始辅音和元音的不匹配反应
3. Hidden Markov modeling of frequency-following responses to Mandarin lexical tones [J] . Llanos Fernando, Xie Zilong, Chandrasekaran Bharath Journal of Neuroscience Methods . 2017,第期

机译：隐藏的Markov模型对跨越词汇音调的频率跟随响应
4. Main vowel domain tone modeling with lexical and prosodic analysis for Mandarin ASR [C] . Shilei Zhang, Qin Shi, Chu S.M., IEEE International Conference on Acoustics, Speech and Signal Processing;ICASSP 2009 . 2009

机译：普通话ASR的主元音域音调建模与词汇和韵律分析
5. THE PROSODIC DOMAIN OF TONE SANDHI IN CHINESE (PHRASAL PHONOLOGY, TONAL TYPOLOGY, MANDARIN, SYNTAX-PHONOLOGY INTERFACE). [D] . SHIH, CHI-LIN. 1986

机译：汉语中音调三音的韵律域（短语音系，音调，普通话，语法音系）。
6. Hidden Markov Modeling of Frequency-Following Responses to Mandarin Lexical Tones [O] . Fernando Llanos, Zilong Xie, Bharath Chandrasekaran -1

机译：对普通话音调的频率跟随响应的隐马尔可夫建模
7. Mandarin Lexical Tones: A Corpus-Based Study of Word Length, Syllable Position and Prosodic Position on Duration [O] . Yaru Wu, Martine Adda-Decker, Lori Lamel 2020

机译：普通话词汇音：基于语料库的词长，音节位置和持续时间韵律位置的研究

MAIN VOWEL DOMAIN TONE MODELING WITH LEXICAL AND PROSODIC ANALYSIS FOR MANDARIN ASR

摘要

著录项

相似文献

相关主题

期刊订阅