An Amplitude Warping Approach to Intra-speaker Normalization for Speech Recognition

机译：用于语音识别的扬声器内归一化的幅度变形方法

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In this paper, we present an intra-speaker warping factor estimation based on pitch alteration utterance. The feature space distributions of untrans-formed speech from the pitch alteration utterance of intra-speaker would vary due to the acoustic differences of speech produced by glottis and vocal tract. Therefore, it may be possible to determine the amplitude warping factor by calculating the inverse ratio of input to reference pitch. As the recognition results, the error rate is reduced from 0.4% to 2.3% for digit and word decoding.

机译：在本文中，我们提出了基于音调变化话语的扬声器内翘曲因子估计。由于声门和声道产生的语音声学差异，来自扬声器内音调变化发声的未转换语音的特征空间分布将发生变化。因此，有可能通过计算输入与参考音高的反比来确定幅度扭曲因数。作为识别结果，数字和单词解码的错误率从0.4％降低到2.3％。

著录项

来源
《International Conference on Computational Science and Its Applications - ICCSA 2003 Pt.2 May 18-21, 2003 Montreal, Canada》|2003年|p.639-645|共7页
会议地点 Montreal(CA);Montreal(CA)
作者
Kwang-Seok Hong;
展开▼
作者单位

School of Information and Communication Engineering and SITRC, Sungkyunkwan University, Suwon, Korea;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Cepstral Amplitude Range Normalization for Noise Robust Speech Recognition [J] . Shingo YOSHIZAWA, Noboru HAYASAKA, Naoya WADA, IEICE Transactions on Information and Systems . 2004,第8期

机译：倒谱幅度范围归一化，用于噪声鲁棒语音识别
2. Normalization of the Speech Modulation Spectra for Robust Speech Recognition [J] . Xiong Xiao, Eng Siong Chng, Haizhou Li IEEE transactions on audio, speech and language processing . 2008,第8期

机译：语音调制谱的归一化以实现可靠的语音识别
3. Temporal Structure Normalization of Speech Feature for Robust Speech Recognition [J] . Xiao X., Chng E. S., Li H. IEEE signal processing letters . 2007,第7期

机译：语音特征的时态结构归一化，用于鲁棒语音识别
4. An Amplitude Warping Approach to Intra-speaker Normalization for Speech Recognition [C] . Kwang-Seok Hong International confernce on computational science and its applications . 2003

机译：语音识别中讲话归一化的幅度翘曲方法
5. Frequency warping by linear transformation, and vocal tract inversion for speaker normalization in automatic speech recognition. [D] . Panchapagesan, Sankaran. 2008

机译：通过线性变换实现的频率扭曲和声道反转，可在自动语音识别中实现说话人归一化。
6. One-against-All Weighted Dynamic Time Warping for Language-Independent and Speaker-Dependent Speech Recognition in Adverse Conditions [O] . Xianglilan Zhang, Jiping Sun, Zhigang Luo 2010

机译：不利条件下与语言无关和与说话者相关的语音识别的一对多加权动态时间规整
7. On the issues of intra-speaker variability and realism in speech, speaker, and language recognition tasks [O] . John H.L. Hansen, Hynek Bořil 2018

机译：论语音，演讲者和语言识别任务中讲话者变异性和现实主义的问题

An Amplitude Warping Approach to Intra-speaker Normalization for Speech Recognition

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅