A spatio-temporal speech enhancement scheme for robust speech recognition in noisy environments

Erik Visser; Manabu Otsuka; Te-Won Lee

首页> 外文期刊>Speech Communication >A spatio-temporal speech enhancement scheme for robust speech recognition in noisy environments

【24h】

A spatio-temporal speech enhancement scheme for robust speech recognition in noisy environments

机译：时空语音增强方案，用于嘈杂环境中的健壮语音识别

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

A new speech enhancement scheme is presented integrating spatial and temporal signal processing methods for robust speech recognition in noisy environments. The scheme first separates spatially localized point sources from noisy speech signals recorded by two microphones. Blind source separation algorithms assuming no a priori knowledge about the sources involved are applied in this spatial processing stage. Then denoising of distributed background noise is achieved in a combined spatial/temporal processing approach. The desired speaker signal is first processed along with an artificially constructed noise signal in a supplementary blind source separation step. It is further denoised by exploiting differences in temporal speech and noise statistics in a wavelet filterbank. The scheme's performance is illustrated by speech recognition experiments on real recordings in a noisy car environment. In comparison to a common multi-microphone technique like beamforming with spectral subtraction, the scheme is shown to enable more accurate speech recognition in the presence of a highly interfering point source and strong background noise.

机译：提出了一种新的语音增强方案，该方案集成了空间和时间信号处理方法，可在嘈杂的环境中进行鲁棒的语音识别。该方案首先从两个麦克风记录的嘈杂语音信号中分离出空间局部点源。假设没有有关源的先验知识的盲源分离算法将应用于此空间处理阶段。然后，以组合的空间/时间处理方法实现对分布式背景噪声的去噪。首先，在辅助盲源分离步骤中，将所需的扬声器信号与人工构建的噪声信号一起进行处理。通过利用小波滤波器组中的时间语音和噪声统计数据的差异来进一步消除噪声。该方案的性能通过在嘈杂的汽车环境中对真实录音进行语音识别实验来说明。与常见的多麦克风技术（例如带频谱减法的波束成形）相比，该方案在存在高度干扰的点源和强烈的背景噪声的情况下，能够实现更准确的语音识别。

著录项

来源
《Speech Communication》 |2003年第3期|p. 393-407|共15页
作者
Erik Visser; Manabu Otsuka; Te-Won Lee;
展开▼
作者单位

Institute for Neural Computation, University of California, San Diego, 9500 Gilman Drive, Dept 0523, La Jolla, CA 92093-0523, USA;

DENSO Corporation, Research Laboratories, 500-1 Minamiyama Komenoki, Nisshin Aichi 470-0111, Japan;

Institute for Neural Computation, University of California, San Diego, 9500 Gilman Drive, Dept 0523, La Jolla, CA 92093-0523, USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类语言、文字;
关键词
speech enhancement; robust speech recognition; blind source separation; noisy environments;

机译：语音增强;强大的语音识别;盲源分离嘈杂的环境;

相似文献

外文文献
中文文献
专利

1. An effective cluster-based model for robust speech detection and speech recognition in noisy environments [J] . Gorriz JM, Ramirez J, Segura JC, The Journal of the Acoustical Society of America . 2006,第1期

机译：在嘈杂环境中用于鲁棒语音检测和语音识别的有效基于群集的模型
2. Robust speechon-speech detection based on LDA-derived parameter and voicing parameter for speech recognition in noisy environments [J] . Arnaud Martin, Laurent Mauuary Speech Communication . 2006,第2期

机译：基于LDA派生参数和发声参数的鲁棒语音/非语音检测用于嘈杂环境中的语音识别
3. Auditory processing of speech signals for robust speech recognition in real-world noisy environments [J] . Doh-Suk Kim, Soo-Young Lee IEEE Transactions on Speech and Audio Proceeding . 1999,第1期

机译：语音信号的听觉处理，可在实际嘈杂的环境中实现强大的语音识别
4. Robust Recognition of Noisy Speech Using Speech Enhancement [C] . Xu Yifang, Zhang Jinjie, Yao Kaisheng, 16~(th) World Computer Congress 2000 and 2000 5~(th) International Conference on Signal Processing Proceedings Vol.II August 21-25, 2000, Beijing, China . 2000

机译：使用语音增强功能对嘈杂语音进行鲁棒识别
5. Evaluation of speech enhancement techniques for speaker recognition in noisy environments. [D] . El-Solh, Abdel-Aziz. 2006

机译：在嘈杂环境中评估语音增强技术以进行说话人识别。
6. Robust EEG-Based Decoding of Auditory Attention With High-RMS-Level Speech Segments in Noisy Conditions [O] . Lei Wang, Ed X. Wu, Fei Chen 2020

机译：基于危险的eeg的eeg的解码在嘈杂的条件下具有高rms级语音段的听觉注意力
7. Speech enhancement strategy for speech recognition microcontroller under noisy environments [O] . Chan, KY, Nordholm, S, Yiu, KFC, 2013

机译：嘈杂环境下语音识别微控制器的语音增强策略

A spatio-temporal speech enhancement scheme for robust speech recognition in noisy environments

摘要

著录项

相似文献

相关主题

期刊订阅