...
【24h】

MVA Processing of Speech Features

机译:语音特征的MVA处理

获取原文
获取原文并翻译 | 示例

摘要

In this paper, we investigate a technique consisting of mean subtraction, variance normalization and time sequence filtering. Unlike other techniques, it applies auto-regression moving-average (ARMA) filtering directly in the cepstral domain. We call this technique mean subtraction, variance normalization, and ARMA filtering (MVA) post-processing, and speech features with MVA post-processing are called MVA features. Overall, compared to raw features without post-processing, MVA features achieve an error rate reduction of 45% on matched tasks and 65% on mismatched tasks on the Aurora 2.0 noisy speech database, and an average 57% error reduction on the Aurora 3.0 database. These improvements are comparable to the results of much more complicated techniques even though MVA is relatively simple and requires practically no additional computational cost. In this paper, in addition to describing MVA processing, we also present a novel analysis of the distortion of mel-frequency cepstral coefficients and the log energy in the presence of different types of noise. The effectiveness of MVA is extensively investigated with respect to several variations: the configurations used to extract and the type of raw features, the domains where MVA is applied, the filters that are used, the ARMA filter orders, and the causality of the normalization process. Specifically, it is argued and demonstrated that MVA works better when applied to the zeroth-order cepstral coefficient than to log energy, that MVA works better in the cepstral domain, that an ARMA filter is better than either a designed finite impulse response filter or a data-driven filter, and that a five-tap ARMA filter is sufficient to achieve good performance in a variety of settings. We also investigate and evaluate a multi-domain MVA generalization
机译:在本文中,我们研究了一种由均值减法,方差归一化和时序过滤组成的技术。与其他技术不同,它直接在倒频谱域中应用自回归移动平均(ARMA)滤波。我们称此技术为减法,方差归一化和ARMA滤波(MVA)后处理,而具有MVA后处理的语音功能称为MVA功能。总体而言,与没有后处理的原始功能相比,MVA功能在Aurora 2.0嘈杂语音数据库上对匹配任务的错误率降低了45%,对不匹配任务的错误率降低了65%,在Aurora 3.0数据库上平均降低了57% 。即使MVA相对简单并且实际上不需要额外的计算成本,这些改进也可以与复杂得多的技术的结果进行比较。在本文中,除了描述MVA处理之外,我们还提出了在存在不同类型噪声的情况下对mel频率倒谱系数的失真和对数能量的新颖分析。广泛研究了MVA的有效性,涉及以下几个方面:用于提取的配置和原始特征的类型,应用MVA的域,使用的过滤器,ARMA过滤器的阶数以及归一化过程的因果关系。具体而言,有论证和论证表明,将MVA应用于零阶倒频谱系数比记录能量要好,MVA在倒频谱域中效果更好,ARMA滤波器比设计的有限脉冲响应滤波器或AVA滤波器都好。数据驱动的滤波器,而一个五抽头的ARMA滤波器足以在各种设置下实现良好的性能。我们还将研究和评估多域MVA概括

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号