首页> 外文会议> >A semi-continuous stochastic trajectory model for phoneme-based continuous speech recognition
【24h】

A semi-continuous stochastic trajectory model for phoneme-based continuous speech recognition

机译:基于音素的连续语音识别的半连续随机轨迹模型

获取原文
获取外文期刊封面目录资料

摘要

We propose a model of phoneme-based speech unit, called semi-continuous stochastic trajectory model (SC-STM), which generalizes our stochastic trajectory models (STM). As STMs, the SC-STMs focus on the modeling of speech segments (called trajectories) in their parameter space, and can therefore handle segmental information, which is critical for large vocabulary continuous speech recognition. Compared to the STMs, the SC-STMs improve the resolution of the trajectory modeling, while keeping a moderate number of free parameters by sharing state probability density functions. The SC-STM can therefore maintain a good trade-off between detailed acoustic modeling and limited training data. We tested the idea on a 2010 words, speaker-dependent, continuous speech database. Preliminary results show that SC-STM gives a word accuracy close to that of STM, without using heuristic techniques that enhanced STM.
机译:我们提出了一种基于音素的语音单元模型,称为半连续随机轨迹模型(SC-STM),该模型概括了我们的随机轨迹模型(STM)。作为STM,SC-STM专注于在其参数空间中对语音片段(称为轨迹)进行建模,因此可以处理片段信息,这对于大词汇量连续语音识别至关重要。与STM相比,SC-STM改进了轨迹建模的分辨率,同时通过共享状态概率密度函数保持适量的自由参数。因此,SC-STM可以在详细的声学模型和有限的训练数据之间保持良好的权衡。我们在2010单词,与说话者相关的连续语音数据库中测试了该想法。初步结果表明,在不使用增强STM的启发式技术的情况下,SC-STM的字准确度接近STM。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号