首页> 外文会议> >Duration normalization for improved recognition of spontaneous and read speech via missing feature methods

【24h】

Duration normalization for improved recognition of spontaneous and read speech via missing feature methods

机译：持续时间归一化，通过缺失特征方法改善对自发和阅读语音的识别

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Hidden Markov models (HMMs) are known to model the duration of sound units poorly. We present a technique to normalize the duration of each phone to overcome this weakness, with the conjecture that speech with normalized phone durations may be better modeled and discriminated using standard HMM acoustic models. Duration normalization is accomplished by dropping frames if a phone is longer than the desired duration and by adding "missing" frames and reconstructing them if a phone is shorter than the desired duration. If phone segmentations are known a priori, we achieve a 15.8% reduction in relative word error rate (WER) on spontaneous speech and a 10.3% reduction in relative WER on read speech. Preliminary work with automatic phone segmentations derived from the data is also presented.

机译：众所周知，隐马尔可夫模型（HMM）很难对声音单位的持续时间进行建模。我们提出了一种标准化每个电话的持续时间以克服此弱点的技术，并推测可以使用标准HMM声学模型更好地建模和区分具有标准化电话持续时间的语音。如果电话的长度比期望的持续时间长，则通过丢弃帧来实现持续时间的归一化;如果电话的长度比期望的持续时间短，则通过添加“丢失”帧并对其进行重构来实现持续时间归一化。如果先验地知道电话细分，那么自发语音的相对单词错误率（WER）降低了15.8％，而阅读语音的相对WER降低了10.3％。还介绍了根据数据自动进行电话细分的初步工作。

著录项

来源
《》|2001年|P.313-316|共4页
会议地点
作者
Nedel; J.P.; Stern; R.M.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词

相似文献

外文文献
中文文献
专利

1. A Novel Mask Estimation Method Employing Posterior-Based Representative Mean Estimate for Missing-Feature Speech Recognition [J] . Wooil Kim, Hansen J.H.L. Audio, Speech, and Language Processing, IEEE Transactions on . 2011,第5期

机译：一种基于后验的代表性均值估计的新的掩模估计方法用于特征缺失语音识别
2. Feature Extraction Method for Improving Speech Recognition in Noisy Environments [J] . Youssef Zouhir, Kaies Ouni Journal of computer sciences . 2016,第2期

机译：噪声环境下提高语音识别能力的特征提取方法
3. Feature Extraction Method for Improving Speech Recognition in Noisy Environments | Science Publications [J] . Ka?s Ouni, Youssef Zouhir Journal of computer sciences . 2016,第2期

机译：噪声环境下提高语音识别性能的特征提取方法科学出版物
4. Duration normalization for improved recognition of spontaneous and read speech via missing feature methods [C] . Jon P. Nedel, Richard M. Stern IEEE International Conference on Acoustics, Speech, and Signal Processing . 2001

机译：持续时间归一化，用于通过缺少的特征方法改进自发性和读取语音的识别
5. Duration normalization for robust recognition of spontaneous speech via missing feature methods. [D] . Nedel, Jon P. 2004

机译：持续时间归一化，可通过缺失特征方法对自发语音进行可靠识别。
6. On the Speech Properties and Feature Extraction Methods in Speech Emotion Recognition [O] . Juraj Kacur, Boris Puterka, Jarmila Pavlovicova, 2021

机译：语音情感识别中的语音特性和特征提取方法
7. Duration Normalization For Improved Recognition Of Spontaneous And Read Speech Via Missing Feature Methods [O] . Jon P. Nedel, Richard M. Stern 2001

机译：持续时间归一化，通过缺失特征方法改进对自发和朗读语音的识别

Duration normalization for improved recognition of spontaneous and read speech via missing feature methods

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅