Frequency-domain maximum likelihood estimation for automatic speechrecognition in additive and convolutive noises

Zhao Y.

首页> 外文期刊>IEEE Transactions on Speech and Audio Proceessing >Frequency-domain maximum likelihood estimation for automatic speechrecognition in additive and convolutive noises

【24h】

Frequency-domain maximum likelihood estimation for automatic speechrecognition in additive and convolutive noises

机译：累加和卷积噪声中自动语音识别的频域最大似然估计

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

A feature estimation technique is proposed for speech signals that are degraded by both additive and convolutive noises. An EM algorithm is formulated in the frequency-domain for identification of the magnitude response of the distortion channel and power spectrum of additive noise, and posterior estimates of short-time power spectra of speech are obtained based on the identified channel and noise. The estimated posterior power spectra are used to calculate perceptually-based linear prediction cepstral coefficients, and the estimated cepstral features and their temporal regression coefficients are used for automatic speech recognition using acoustic models trained from clean speech. Experiments were performed on speaker independent continuous speech recognition, where the speech data were taken from the TIMIT database and were degraded by a distortion channel and simulated additive noises with white or colored spectral characteristics at various SNR levels. Experimental results indicate that the proposed technique leads to convergent identification of channel and noise and significantly improved recognition accuracy for speaker-independent continuous speech

机译：提出了一种针对语音信号的特征估计技术，该语音信号会由于加性和卷积性噪声而退化。在频域中提出了一种EM算法，用于识别失真通道的幅度响应和加性噪声的功率谱，并基于识别出的通道和噪声获得语音的短期功率谱的后验估计。估计的后验功率谱用于计算基于感知的线性预测倒谱系数，并且估计的倒谱特征及其时间回归系数用于使用从纯语音训练的声学模型进行自动语音识别。对说话者独立的连续语音识别进行了实验，其中语音数据来自TIMIT数据库，并通过失真通道和模拟的加性噪声在各种SNR级别具有白色或彩色频谱特性而退化。实验结果表明，所提出的技术可以对信道和噪声进行收敛性识别，并大大提高了与说话者无关的连续语音的识别精度

著录项

来源
《IEEE Transactions on Speech and Audio Proceessing》 |2000年第3期|p.255-266|共12页
作者
Zhao Y.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类电声技术和语音信号处理;
关键词
cepstral analysis; convolution; frequency-domain analysis; maximum likelihood estimation; optimisation; prediction theory; speech recognition; telecommunication channels; white noise; EM algorithm; SNR levels; TIMIT database; acoustic models; additive noise; automatic;

机译：倒频谱分析;卷积;频域分析;最大似然估计;优化;预测理论;语音识别;电信信道;白噪声;EM算法;SNR级别;TIMIT数据库;声学模型;加性噪声;自动;
入库时间 2022-08-18 00:13:23

相似文献

外文文献
中文文献
专利

1. Frequency-domain maximum likelihood estimation for automatic speech recognition in additive and convolutive noises [J] . Zhao Y. IEEE Transactions on Speech and Audio Proceeding . 2000,第3期

机译：用于加性和卷积噪声中自动语音识别的频域最大似然估计
2. Maximum likelihood DOA estimation and asymptotic Cramer-Rao bounds for additive unknown colored noise [J] . Hao Ye, DeGroat D. IEEE Transactions on Signal Processing . 1995,第4期

机译：附加未知色噪声的最大似然DOA估计和渐近Cramer-Rao边界
3. Parameter identification of fluid line networks by frequency-domain maximum likelihood estimation [J] . Aaron C. Zecchin, Langford B. White, Martin F. Lambert, Mechanical systems and signal processing . 2013,第1a2期

机译：基于频域最大似然估计的流体管路网络参数识别
4. THE MINIMUM NUMBER OF SCANNING WINDOWS REQUIRED FOREFFECTIVE MAXIMUM LIKELIHOOD ESTIMATION OF IMAGE TEXTUREPARAMETERS AND ADDITIVE NOISE VARIANCE [C] . Mikhail Uss, Benoit Vozel, Kacem. Chehdi, International Kharkov Symposium on Physics and Engineering of Microwaves, Millimeter and Submillimeter Waves . 2010

机译：有效最大似然估计的图像纹理参数和附加噪声方差所需的最小扫描窗口数
5. Maximum likelihood estimation of exponentials in unknown colored noise for target identification in synthetic aperture radar images. [D] . Pepin, Matthew Peter. 1996

机译：用于合成孔径雷达图像中目标识别的未知彩色噪声中指数的最大似然估计。
6. Microarray background correction: maximum likelihood estimation for the normal–exponential convolution [O] . Jeremy D. Silver, Matthew E. Ritchie, Gordon K. Smyth -1

机译：芯片背景校正：正态-指数卷积的最大似然估计
7. Effect of pulse noise jamming and phase noise on a coherent RAKE receiver with maximum-likelihood detection and convolutional coding [O] . K. Kowalske, R.C. Robertson -1

机译：脉冲噪声干扰和相位噪声对具有最大似然检测和卷积编码的相干RAKE接收机的影响

Frequency-domain maximum likelihood estimation for automatic speechrecognition in additive and convolutive noises

摘要

著录项

相似文献

相关主题

期刊订阅