首页> 外文会议>International Conference on Spoken Language Processing; 20041004-08; Jeju(KR) >Effective Acoustic Modeling for Rate-of-Speech Variation in Large Vocabulary Conversational Speech Recognition
【24h】

Effective Acoustic Modeling for Rate-of-Speech Variation in Large Vocabulary Conversational Speech Recognition

机译:大型词汇会话语音识别中语速变化的有效声学建模

获取原文
获取原文并翻译 | 示例

摘要

We investigate several variants of speech-rate-dependent acoustic models for large-vocabulary conversational speech recognition, in the framework of combining rate-specific models in decoding to compensate for speech rate variation. We study two basic approaches to combining rate-specific models: one combines models at the pronunciation level and the other at the HMM state level. Furthermore, we investigate the influence of different numbers of rate-of-speech classes and different parameter tying schemes. Experiments on the Switchboard database, using SRI's DECIPHER recognition system, show that rate-dependent acoustic modeling resulted in a 2% relative word error rate reduction over a rate-independent baseline, and that the pronunciation-level constraint, Gaussian sharing between rate-specific models, and a well-chosen number of rate-of-speech classes are all important for best performance.
机译:我们在结合速率特定模型进行解码以补偿语音速率变化的框架下,研究了针对大词汇量会话语音识别的语音速率相关声学模型的几种变体。我们研究了两种结合速率特定模型的基本方法:一种是在语音级别结合模型,另一种是在HMM状态级别结合模型。此外,我们研究了不同数量的语音速率类和不同的参数绑定方案的影响。使用SRI的DECIPHER识别系统在Switchboard数据库上进行的实验表明,与速率无关的声学模型导致在与速率无关的基准上相对词错误率降低了2%,并且语音级别约束,特定于速率之间的高斯共享模型,以及良好的语速等级选择对于确保最佳性能都非常重要。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号