Minimum Mean-Square Error Estimation of Mel-Frequency Cepstral Features–A Theoretically Consistent Approach

Jensen J.; Tan Z.-H.

首页> 外文期刊>Audio, Speech, and Language Processing, IEEE/ACM Transactions on >Minimum Mean-Square Error Estimation of Mel-Frequency Cepstral Features–A Theoretically Consistent Approach

【24h】

Minimum Mean-Square Error Estimation of Mel-Frequency Cepstral Features–A Theoretically Consistent Approach

机译：频率倒谱特征的最小均方误差估计-理论上一致的方法

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this work, we consider the problem of feature enhancement for noise-robust automatic speech recognition (ASR). We propose a method for minimum mean-square error (MMSE) estimation of mel-frequency cepstral features, which is based on a minimum number of well-established, theoretically consistent statistical assumptions. More specifically, the method belongs to the class of methods relying on the statistical framework proposed in Ephraim and Malah’s original work (“Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator,” IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-32, no. 6, 1984). The method is general in that it allows MMSE estimation of mel-frequency cepstral coefficients (MFCC’s), cepstral-mean subtracted (CMS-) MFCC’s, autoregressive-moving-average (ARMA)-filtered CMS-MFCC’s, velocity, and acceleration coefficients. In addition, the method is easily modified to take into account other compressive non-linearities than the logarithm traditionally used for MFCC computation. In terms of MFCC estimation performance, as measured by MFCC mean-square error, the proposed method shows performance which is identical to or better than other state-of-the-art methods. In terms of ASR performance, no statistical difference could be found between the proposed method and the state-of-the-art methods. We conclude that existing state-of-the-art MFCC feature enhancement algorithms within this class of algorithms, while theoretically suboptimal or based on theoretically inconsistent assumptions, perform close to optimally in the MMSE sense.

机译：在这项工作中，我们考虑了抗噪自动语音识别（ASR）的功能增强问题。我们提出了一种方法，该方法基于最低限度的成熟的，理论上一致的统计假设，来估计梅尔频率倒谱特征的最小均方误差（MMSE）。更具体地说，该方法属于依赖于Ephraim和Malah的原始工作（“使用最小均方误差短时频谱幅度估计器的语音增强”，IEEE Trans。Acoust。，语音，信号处理，第ASSP-32卷，第6号，1984年）。该方法具有通用性，因为它允许MMSE估计梅尔频率倒谱系数（MFCC），倒谱均值（CMS-）MFCC，自回归移动平均值（ARMA）滤波的CMS-MFCC，速度和加速度系数。另外，该方法易于修改，以考虑到除传统上用于MFCC计算的对数以外的其他压缩非线性。就MFCC估计性能而言，通过MFCC均方误差测量，所提出的方法显示出与其他最新技术相同或更好的性能。在ASR性能方面，建议的方法与最新方法之间没有统计差异。我们得出的结论是，此类算法中现有的最先进的MFCC特征增强算法，尽管在理论上不是最佳选择或基于理论上不一致的假设，但在MMSE方面的表现接近最佳。

著录项

来源
《Audio, Speech, and Language Processing, IEEE/ACM Transactions on》 |2015年第1期|186-197|共12页
作者
Jensen J.; Tan Z.-H.;
展开▼
作者单位

Aalborg University, Aalborg, Denmark;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Estimation; Mean square error methods; Mel frequency cepstral coefficient; Noise; Noise measurement; Speech; Robust automatic speech recognition (ASR); mel-frequency cepstral coefficient (MFCC); minimum mean-square error (MMSE) estimation; speech enhancement;

机译：估计;均方误差方法;梅尔倒谱系数;噪声;噪声测量;语音;稳健自动语音识别（ASR）;梅尔倒谱系数（MFCC）;最小均方误差（MMSE）估计;语音增强;

相似文献

外文文献
中文文献
专利

1. Minimum Mean-Squared Error Estimation of Mel-Frequency Cepstral Coefficients Using a Novel Distortion Model [J] . Indrebo K.M., Povinelli R.J., Johnson M.T. IEEE transactions on audio, speech and language processing . 2008,第8期

机译：使用新型失真模型的梅尔频率倒谱系数的最小均方误差估计
2. Vocal Fold Pathology Assessment Using Mel-Frequency Cepstral Coefficients and Linear Predictive Cepstral Coefficients Features [J] . Jennifer C. Saldanha, T. Ananthakrishna, Rohan Pinto Journal of Medical Imaging and Health Informatics . 2014,第2期

机译：使用Mel频率倒谱系数和线性预测倒谱系数功能进行人声折叠病理评估
3. Bayesian Minimum Mean-Square Error Estimation for Classification Error—Part I: Definition and the Bayesian MMSE Error Estimator for Discrete Classification [J] . Dalton L.A., Dougherty E.R. Signal Processing, IEEE Transactions on . 2011,第1期

机译：分类误差的贝叶斯最小均方误差估计—第一部分：离散分类的定义和贝叶斯MMSE误差估计器
4. A theoretically consistent method for minimum mean-square error estimation of mel-frequency cepstral features [C] . Jensen Jens, Zheng-Hua Tan IEEE International Conference on Network Infrastructure and Digital Content . 2014

机译：梅尔频率倒谱特征最小均方误差估计的理论上一致的方法
5. Mean Square Error Analysis in Orthogonal Frequency Division Multiplexing Systems for Least Square and Minimum Mean Square Error Channel Estimation [D] . Thote, Ketaki Avinash. 2018

机译：正交频分复用系统中最小二乘和最小均方误差信道估计的均方误差分析
6. Augmented GNSS Differential Corrections Minimum Mean Square Error Estimation Sensitivity to Spatial Correlation Modeling Errors [O] . Nazelie Kassabian, Letizia Lo Presti, Francesco Rispoli 2014

机译：增强的GNSS微分校正最小均方误差估计对空间相关建模误差的敏感性
7. Minimum Mean-Squared Error Estimation of Mel-Frequency Cepstral Coefficients Using a Novel Distortion Model [O] . K.M. Indrebo, R.J. Povinelli, M.T. Johnson 2008

机译：使用新型失真模型的熔融谱系系数的最小平均平均误差估计

Minimum Mean-Square Error Estimation of Mel-Frequency Cepstral Features–A Theoretically Consistent Approach

摘要

著录项

相似文献

相关主题

期刊订阅