首页> 外文会议>2012 8th International Symposium on Chinese Spoken Language Processing. >A study on cross-language knowledge integration in Mandarin LVCSR
【24h】

A study on cross-language knowledge integration in Mandarin LVCSR

机译:普通话LVCSR中跨语言知识整合研究。

获取原文
获取原文并翻译 | 示例

摘要

We present a cross-language knowledge integration framework to improve the performance in large vocabulary continuous speech recognition. Two types of knowledge sources, manner attribute and prosodic structure, are incorporated. For manner of articulation, cross-lingual attribute detectors trained with an American English corpus (WSJ0) are utilized to verify and rescore hypothesized Mandarin syllables in word lattices obtained with state-of-the-art systems. For the prosodic structure, models trained with an unsupervised joint prosody labeling and modeling technique using a Mandarin corpus (TCC300) are used in lattice rescoring. Experimental results on Mandarin syllable, character and word recognition with the TCC300 corpus show that the proposed approach significantly outperforms the baseline system that does not use articulatory and prosodic information. It also demonstrates a potential of utilizing results from cross-lingual attribute detectors as a language-universal frontend for automatic speech recognition.
机译:我们提出了一种跨语言的知识集成框架,以提高大词汇量连续语音识别的性能。合并了两种类型的知识源,即方式属性和韵律结构。对于表达方式,使用经过美国英语语料库(WSJ0)训练的跨语言属性检测器来验证和重新计算由最新技术系统获得的单词格中的假设普通话音节。对于韵律结构,使用普通话语料库(TCC300)的无监督关节韵律标记和建模技术训练的模型用于格点计分。 TCC300语料库对普通话音节,字符和单词识别的实验结果表明,所提出的方法明显优于不使用发音和韵律信息的基线系统。它还展示了利用跨语言属性检测器的结果作为自动语音识别的通用语言前端的潜力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号