FILTERING ON THE TEMPORAL PROBABILITY SEQUENCE IN HISTOGRAM EQUALIZATION FOR ROBUST SPEECH RECOGNITION

机译：滤除鲁棒语音识别直方图均衡中的时间概率序列

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we propose a filter-based histogram equalization (FHEQ) approach for robust speech recognition. The FHEQ approach first represents the original acoustic feature sequence with statistic probability. Then, a temporal average (TA) filter is applied to smooth the statistic probability sequence. Finally, the filtered statistic probability sequence is transformed to form a new acoustic feature stream. Filtering on statistic probability of a feature sequence is a novel concept that can incorporate the advantages of the conventional histogram equalization (HEQ) and temporal filtering techniques for better noise robustness. Our experimental results on the Aurora-2 and Aurora-4 tasks show that FHEQ outperforms the conventional cepstral mean subtraction (CMS), cepstral mean and variance normalization (CMVN), and HEQ. Furthermore, we conducted a comparison test on TA-HEQ and HEQ-TA, which apply a TA filter to smooth acoustic features before and after the HEQ processing, respectively. The test results show that FHEQ outperforms both TA-HEQ and HEQ-TA, suggesting that filtering in probability is more effective than filtering in acoustic feature.

机译：在本文中，我们提出了一种基于滤波器的直方图均衡（FHEQ）方法，用于鲁棒语音识别。 FHEQ方法首先表示具有统计概率的原始声学特征序列。然后，应用时间平均值（TA）滤波器以平滑统计概率序列。最后，转换过滤的统计概率序列以形成新的声学特征流。过滤特征序列的统计概率是一种新颖的概念，可以包含传统直方图均衡（HEQ）和时间过滤技术的优点，以获得更好的噪声鲁棒性。我们对极光-2和极光-4任务的实验结果表明，FHEQ优于传统的抗搏斯平均减法（CMS），抗康斯兰均值和方差标准化（CMVN）和HEQ。此外，我们对TA-HEQ和HEQ-TA进行了比较测试，其将TA过滤器应用于HEQ处理之前和之后的光滑声学特征。测试结果表明，FHEQ优于TA-HEQ和HEQ-TA，表明概率滤波比声学特征的滤波更有效。

著录项

来源
《IEEE International Conference on Acoustics, Speech, and Signal Processing》|2013年||共5页
会议地点
作者
Syu-Siang Wang; Yu Tsao; Jeih-weih Hung;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912-53;
关键词

相似文献

外文文献
中文文献
专利

1. Histogram equalization of contextual statistics of speech features for robust speech recognition [J] . Hsieh Hsin-Ju, Chen Berlin, Hung Jeih-weih Multimedia Tools and Applications . 2015,第17期

机译：语音特征语境统计的直方图均衡化，可增强语音识别能力
2. Histogram equalization of speech representation for robust speech recognition [J] . de la Torre A., Peinado A.M., Segura J.C., IEEE Transactions on Speech and Audio Proceessing . 2005,第3期

机译：语音表示的直方图均衡化，可增强语音识别能力
3. Histogram equalization with Bayesian estimation for noise robust speech recognition [J] . Suh Youngjoo, Kim Hoirin The Journal of the Acoustical Society of America . 2018,第2期

机译：贝叶斯估计噪声鲁棒语音识别的直方图均衡
4. Filtering on the temporal probability sequence in histogram equalization for robust speech recognition [C] . Wang Syu-Siang, Tsao Yu, Hung Jeih-weih IEEE International Conference on Acoustics, Speech and Signal Processing . 2013

机译：对直方图均衡中的时间概率序列进行滤波以实现鲁棒的语音识别
5. Robust speech processing based on microphone array, audio-visual, and frame selection for in-vehicle speech recognition and in-set speaker recognition. [D] . Zhang, Xianxian. 2005

机译：基于麦克风阵列，视听和帧选择的强大语音处理功能，可实现车载语音识别和内置说话人识别。
6. New Features Using Robust MVDR Spectrum of Filtered Autocorrelation Sequence for Robust Speech Recognition [O] . Sanaz Seyedin, Seyed Mohammad Ahadi, Saeed Gazor 2013

机译：使用滤波自相关序列的鲁棒MVDR频谱进行鲁棒语音识别的新功能
7. Front-End Post-Processing Using Histogram Equalization Combined with ARMA Filtering for Noise Robust Speech Recognition [O] . Shariati Seyedeh Saloomeh, Ahadi Mohammad, Mohammadi Karim 2007

机译：直方图均衡与ARMA滤波相结合的前端后处理，用于噪声鲁棒的语音识别

FILTERING ON THE TEMPORAL PROBABILITY SEQUENCE IN HISTOGRAM EQUALIZATION FOR ROBUST SPEECH RECOGNITION

摘要

著录项

相似文献

相关主题

期刊订阅