Independent vector analysis followed by HMM-based feature enhancement for robust speech recognition

Ji-Won Cho; Hyung-Min Park

首页> 外文期刊>Signal processing >Independent vector analysis followed by HMM-based feature enhancement for robust speech recognition

【24h】

Independent vector analysis followed by HMM-based feature enhancement for robust speech recognition

机译：独立矢量分析，然后基于HMM的功能增强，可实现可靠的语音识别

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper presents a feature-enhancement method that uses the outputs of independent vector analysis (IVA) for robust speech recognition. Although frequency-domain(FD) independent component analysis (ICA) can be successfully used in preprocessing of speech recognition because of its capability of blind source separation (BSS), the performance of the conventional ICA-based approaches is significantly degraded in under-determined cases. Assuming the target speaker is located relatively close to microphones, the blind spatial subtraction array (BSSA) (Takahashi et al.) tries to enhance target speech features by subtracting noise spectra estimated by FD ICA, even in the under-determined cases. Unfortunately, the ICA may not be proficient at target speech estimation and then may cause inaccurate noise spectrum estimation. To improve robustness of speech recognition with the inaccurate noise spectra, we introduce Bayesian inference to estimate clean speech features. For a further improvement, FD ICA and delay-and-sum beamforming in the BSSA are replaced with IVA and its target speech output because IVA can improve separation performance without the permutation problem. Experimental results show that the proposed method can further reduce the relative word error rates by 60.11% and 20.07% on average compared to the BSSA for the AURORA2 and DARPA Resource Management databases, respectively.

机译：本文提出了一种功能增强方法，该方法使用独立矢量分析（IVA）的输出进行鲁棒的语音识别。尽管由于其盲源分离（BSS）的能力，频域（FD）独立分量分析（ICA）可以成功地用于语音识别的预处理，但是基于ICA的传统方法的性能在未充分确定的情况下会大大降低案件。假设目标说话者位于相对靠近麦克风的位置，即使在不确定的情况下，盲空间减法阵列（BSSA）（Takahashi等人）也试图通过减去FD ICA估计的噪声频谱来增强目标语音特征。不幸的是，ICA可能不擅长目标语音估计，然后可能导致噪声频谱估计不准确。为了使用不准确的噪声频谱提高语音识别的鲁棒性，我们引入贝叶斯推理来估计干净的语音特征。为了进一步改进，将BSSA中的FD ICA和延迟与求和波束成形替换为IVA及其目标语音输出，因为IVA可以改善分离性能而不会出现置换问题。实验结果表明，与AURORA2数据库和DARPA资源管理数据库的BSSA相比，该方法可以将相对字错误率平均分别降低60.11％和20.07％。

著录项

来源
《Signal processing》 |2016年第3期|200-208|共9页
作者
Ji-Won Cho; Hyung-Min Park;
展开▼
作者单位

Department of Electronic Engineering, Sogang University, 35 Baekbeom-ro, Mapo-gu, Seoul, Republic of Korea;

Department of Electronic Engineering, Sogang University, 35 Baekbeom-ro, Mapo-gu, Seoul, Republic of Korea;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Robust speech recognition; Independent vector analysis; Feature enhancement; Bayesian inference;

机译：强大的语音识别;独立向量分析;功能增强;贝叶斯推断;

相似文献

外文文献
中文文献
专利

1. A STATISTICAL ANALYSIS ON THE IMPACT OF SPEECH ENHANCEMENT TECHNIQUES ON THE FEATURE VECTORS OF NOISY SPEECH SIGNALS FOR SPEECH RECOGNITION [J] . SWAPNANIL GOGOI, UTPAL BHATTACHARJEE Journal of computer science engineering and information technology research . 2016,第3期

机译：语音增强技术对语音识别中嘈杂语音信号特征向量影响的统计分析
2. A STATISTICAL ANALYSIS ON THE IMPACT OF SPEECH ENHANCEMENT TECHNIQUES ON THE FEATURE VECTORS OF NOISY SPEECH SIGNALS FOR SPEECH RECOGNITION [J] . SWAPNANIL GOGOI, UTPAL BHATTACHARJEE Journal of computer science engineering and information technology research . 2016,第3期

机译：语音增强技术对语音识别中嘈杂语音信号特征向量影响的统计分析
3. Bayesian feature enhancement using independent vector analysis and reverberation parameter re-estimation for noisy reverberant speech recognition [J] . Cho Jiwon, Park Jonghyeon, Chang Joonhyuk, Computer speech and language . 2017,第nova期

机译：使用独立矢量分析和混响参数重新估计进行贝叶斯特征增强，用于嘈杂的混响语音识别
4. Preprocessing of Independent Vector Analysis Using Feed-Forward Network for Robust Speech Recognition [C] . Myungwoo Oh, Hyung-Min Park International conference on neural information processing;ICONIP 2011 . 2011

机译：使用前馈网络进行独立矢量分析的预处理，以实现可靠的语音识别
5. Feature Design for Robust Speech Recognition: Nurture and Nature. [D] . Chang, Shuo-Yiin. 2016

机译：强大的语音识别功能设计：培养与自然。
6. New Features Using Robust MVDR Spectrum of Filtered Autocorrelation Sequence for Robust Speech Recognition [O] . Sanaz Seyedin, Seyed Mohammad Ahadi, Saeed Gazor 2013

机译：使用滤波自相关序列的鲁棒MVDR频谱进行鲁棒语音识别的新功能
7. Canonicalization of Feature Parameters for Robust Speech Recognition Based on Distinctive Phonetic Feature (DPF) Vectors [O] . M. NURUL HUDA, M. GHULAM, T. FUKUDA, 2008

机译：基于独特拼音特征（DPF）向量的强大语音识别特征参数的Canonicalization

Independent vector analysis followed by HMM-based feature enhancement for robust speech recognition

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅