Code-switch Language Model with Inversion Constraints for Mixed Language Speech Recognition

机译：具有反转约束的混合语言语音识别码转换语言模型

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We propose a first ever code-switch language model for mixed language speech recognition that incorporates syntactic constraints by a code-switch boundary prediction model, a code-switch translation model, and a reconstruction model. A WFST-based decoder then recognizes speech by combining an acoustic model, a pronunciation model and the code-switch language model in an integrated approach. Our proposed approach avoids making early decisions on code-switch boundaries and is therefore more robust than previous approaches. Our proposed system using the code-switch language model outperforms a baseline of interpolated language models by a statistically significant 0.91% on a mixed language lecture speech corpus, and 1.25% on a mixed language lunch conversation corpus. Our method also outperforms a language model that permits code-switch at all word boundaries by a statistically significant 1.35% on the lecture speech corpus and 1.69% on the lunch conversation corpus.

机译：我们提出了第一个用于混合语言语音识别的代码转换语言模型，该模型通过代码转换边界预测模型，代码转换翻译模型和重构模型结合了句法约束。然后，基于WFST的解码器通过以集成方法组合声学模型，发音模型和代码转换语言模型来识别语音。我们提出的方法避免了在代码切换边界上做出早期决策，因此比以前的方法更可靠。我们提出的使用代码切换语言模型的系统在混合语言讲演语料库上比插值语言模型的基准要好，具有统计学上的显着性0.91％，在混合语言午餐会话语料库上有1.25％的统计显着性。我们的方法还优于语言模型，后者允许在所有单词边界上进行代码转换，其演讲语音语料库的统计显着性为1.35％，午餐会话语料库的统计显着性为1.69％。

著录项

来源
《International conference on computational linguistics》|2012年|1671-1680|共10页
会议地点
作者
Ying Li; Pascale Fung;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Code-switch; mixed language; language modeling;

机译：代码切换;混合语言语言建模;

相似文献

外文文献
中文文献
专利

1. Syllable language models for Mandarin speech recognition: Exploiting character language models [J] . Liu X., Hieronymus J.L., Gales M.J.F., The Journal of the Acoustical Society of America . 2013,第1期

机译：普通话语音识别的音节语言模型：利用字符语言模型
2. Comparison of Performance of Enhanced Morpheme-based Language Model with Different Word-based Language Models for Improving the Performance of Tamil Speech Recognition System [J] . S. SARASWATHI, T.V. GEETHA ACM transactions on Asian language information processing . 2007,第3期

机译：增强的基于词素的语言模型与不同的基于单词的语言模型的性能比较，以提高泰米尔语语音识别系统的性能
3. Modeling under-resourced languages for speech recognition [J] . Kurimo Mikko, Enarvi Seppo, Tilk Ottokar, Language Resources and Evaluation . 2017,第4期

机译：为语音识别建模资源不足的语言
4. Improved mixed language speech recognition using asymmetric acoustic model and language model with code-switch inversion constraints [C] . Li Ying, Fung Pascale IEEE International Conference on Acoustics, Speech and Signal Processing . 2013

机译：使用非对称声学模型和具有代码转换反转约束的语言模型改进混合语言语音识别
5. Lexical parsing strategies in two languages: Constraints on language selection in word recognition. [D] . Sumutka, Bianca M. 2003

机译：两种语言的词法解析策略：限制单词识别中的语言选择。
6. Age-Related Changes in Speech Recognition Performance in Spanish–English Bilinguals First and Second Languages [O] . Jamie L. Desjardins, Elisa G. Barraza, Jordan A. Orozco -1

机译：西班牙语-英语双语者的第一语言和第二语言的语音识别性能中与年龄相关的变化
7. An Evaluation Of Statistical Language Modeling For Speech Recognition Using A Mixed Category Of Both Words And Parts-Of-Speech [O] . Yumi Wakita, Jun Kawai, Hitoshi Iida 2007

机译：使用混合类别的单词和词性的语音识别的统计语言建模的评估
8. Speech Recognition, Articulatory Feature Detection, and Speech Synthesis in Multiple Languages [R] . Ore, B. M. 2009

机译：语音识别，发音特征检测和多语言语音合成

Code-switch Language Model with Inversion Constraints for Mixed Language Speech Recognition

摘要

著录项

相似文献

相关主题

期刊订阅