首页> 外文会议>International Workshop on Machine Learning for Multimodal Interaction;MLMI 2008 >A Study of Phoneme and Grapheme Based Context-Dependent ASR Systems
【24h】

A Study of Phoneme and Grapheme Based Context-Dependent ASR Systems

机译:基于音素和字素的上下文相关ASR系统的研究

获取原文
获取外文期刊封面目录资料

摘要

In this paper we present a study of automatic speech recognition systems using context-dependent phonemes and graphemes as sub-word units based on the conventional HMM/GMM system as well as tandem system. Experimental studies conducted on three different continuous speech recognition tasks show that systems using only context-dependent graphemes can yield competitive performance on small to medium vocabulary tasks when compared to a context-dependent phoneme-based automatic speech recognition system. In particular, we demonstrate the utility of tandem features that use an MLP trained to estimate phoneme posterior probabilities in improving grapheme based recognition system performance by implicitly incorporating phonemic knowledge into the system without having to define a phonetically transcribed lexicon.
机译:在本文中,我们提出了一种基于传统HMM / GMM系统和串联系统的基于上下文的音素和字素作为子词单元的自动语音识别系统的研究。在三种不同的连续语音识别任务上进行的实验研究表明,与基于上下文的音素自动语音识别系统相比,仅使用上下文相关的字素的系统在中小词汇量任务上具有竞争优势。特别是,我们演示了串联特征的实用性,该特征使用经过训练的MLP来估计音素后验概率,从而通过隐含地将音素知识合并到系统中而无需定义可语音转录的词典,从而提高基于音素的识别系统的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号