Spatial histogram equalization of complex-valued acoustic spectra in modulation domain for noise-robust speech recognition

机译：调制域中复数值声谱的空间直方图均衡化，用于噪声鲁棒的语音识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper proposes to enhance the complex-valued acoustic spectrograms of speech signals via the technique of histogram equalization (HEQ) to produce noise-robust features for recognition. The presented method extends our previous work in the task of spectrogram enhancement and has two significant aspects. First, we process the real and imaginary parts of acoustic spectrograms separately, and therefore both of the corresponding magnitude and phase components can be enhanced implicitly. Second, we apply FIR filters to the intra-frame acoustic spectra to acquire the respective local structural statistics, which are subsequently employed to perform various types of HEQ on the acoustic spectrograms for robustifying the resulting speech features. All experiments were carried out on the Aurora-2 database and task. The performance of the presented methods was thoroughly tested and verified by comparisons with other well-known robustness methods, which reveals the capability of our methods in promoting the noise robustness of speech features.

机译：本文提出通过直方图均衡（HEQ）技术来增强语音信号的复数值声谱图，以产生用于识别的鲁棒性特征。提出的方法扩展了我们先前在频谱图增强任务中的工作，并具有两个重要方面。首先，我们分别处理声谱图的实部和虚部，因此可以隐式增强相应的幅度和相位分量。其次，我们将FIR滤波器应用于帧内声谱，以获取各自的局部结构统计信息，随后将其用于对声谱图执行各种类型的HEQ，以增强所得到的语音特征。所有实验均在Aurora-2数据库和任务上进行。通过与其他众所周知的鲁棒性方法进行比较，对所提出方法的性能进行了彻底的测试和验证，这揭示了我们的方法在增强语音特征的噪声鲁棒性方面的能力。

著录项

来源
《Asia-Pacific Signal and Information Processing Association Annual Summit and Conference》|2014年|1-6|共6页
会议地点
作者
Hsin-Ju Hsieh; Chen Berlin; Jeih-weih Hung;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
FIR filters; modulation; speech recognition; speech synthesis; Aurora-2 database; FIR filters; complex-valued acoustic spectra; complex-valued acoustic spectrograms; histogram equalization; intraframe acoustic spectra; modulation domain; noise-robust features; noise-robust speech recognition; phase components; spatial histogram equalization; spectrogram enhancement; speech features; structural statistics; Acoustics; Decision support systems; Frequency modulation; Signal to noise ratio; Spectrogram; Speech;

机译：FIR滤波器;调制;语音识别;语音合成; Aurora-2数据库; FIR滤波器;复值声谱;复值声谱图;直方图均衡;帧内声谱;调制域;鲁棒特征;噪声鲁棒的语音识别;相位分量;空间直方图均衡;频谱图增强;语音特征;结构统计;声学;决策支持系统;频率调制;信噪比;频谱图;语音;

相似文献

外文文献
中文文献
专利

1. Intra-frame cepstral sub-band weighting and histogram equalization for noise-robust speech recognition [J] . Jeih-weih Hung, Hao-teng Fan EURASIP Journal on Audio, Speech, and Music Processing . 2013,第1期

机译：帧内倒谱子带加权和直方图均衡，用于噪声鲁棒的语音识别
2. Histogram equalization for noise-robust speech recognition using discrete-mixture HMMs [J] . Tetsuo Kosaka, Masaharu Katoh, Masaki Kohda Acoustical science and technology . 2008,第1期

机译：使用离散混合HMM的直方图均衡用于噪声鲁棒的语音识别
3. Histogram equalization for noise-robust speech recognition using discrete-mixture HMMs [J] . Masaharu Katoh, Masaki Kohda, Tetsuo Kosaka Acoustical science and technology . 2008,第1期

机译：使用离散混合HMM进行直方图均衡以实现鲁棒语音识别
4. Spatial histogram equalization of complex-valued acoustic spectra in modulation domain for noise-robust speech recognition [C] . Hsin-Ju Hsieh, Chen Berlin, Jeih-weih Hung Asia-Pacific Signal and Information Processing Association Annual Summit and Conference . 2014

机译：噪声稳健语音识别调制域复值声光谱的空间直方图均衡
5. Biologically-inspired noise-robust speech recognition for both man and machine. [D] . Skowronski, Mark D. 2004

机译：人机交互的生物启发式鲁棒语音识别。
6. Comparing auditory filter bandwidths spectral ripple modulation detection spectral ripple discrimination and speech recognition: Normal and impaired hearing [O] . Evelyn Davies-Venn, b), Peggy Nelson, -1

机译：比较听觉滤波器的带宽频谱纹波调制检测频谱纹波鉴别和语音识别：听力正常和受损
7. Intra-frame cepstral sub-band weighting and histogram equalization for noise-robust speech recognition [O] . Jeih-weih Hung, Hao-teng Fan 2013

机译：帧内倒谱子带加权和直方图均衡，用于噪声鲁棒的语音识别

Spatial histogram equalization of complex-valued acoustic spectra in modulation domain for noise-robust speech recognition

摘要

著录项

相似文献

相关主题

期刊订阅