首页> 外文会议> >Duration normalization for improved recognition of spontaneous and read speech via missing feature methods
【24h】

Duration normalization for improved recognition of spontaneous and read speech via missing feature methods

机译:持续时间归一化,通过缺失特征方法改善对自发和阅读语音的识别

获取原文
获取外文期刊封面目录资料

摘要

Hidden Markov models (HMMs) are known to model the duration of sound units poorly. We present a technique to normalize the duration of each phone to overcome this weakness, with the conjecture that speech with normalized phone durations may be better modeled and discriminated using standard HMM acoustic models. Duration normalization is accomplished by dropping frames if a phone is longer than the desired duration and by adding "missing" frames and reconstructing them if a phone is shorter than the desired duration. If phone segmentations are known a priori, we achieve a 15.8% reduction in relative word error rate (WER) on spontaneous speech and a 10.3% reduction in relative WER on read speech. Preliminary work with automatic phone segmentations derived from the data is also presented.
机译:众所周知,隐马尔可夫模型(HMM)很难对声音单位的持续时间进行建模。我们提出了一种标准化每个电话的持续时间以克服此弱点的技术,并推测可以使用标准HMM声学模型更好地建模和区分具有标准化电话持续时间的语音。如果电话的长度比期望的持续时间长,则通过丢弃帧来实现持续时间的归一化;如果电话的长度比期望的持续时间短,则通过添加“丢失”帧并对其进行重构来实现持续时间归一化。如果先验地知道电话细分,那么自发语音的相对单词错误率(WER)降低了15.8%,而阅读语音的相对WER降低了10.3%。还介绍了根据数据自动进行电话细分的初步工作。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号