Closely Coupled Array Processing and Model-Based Compensation for Microphone Array Speech Recognition

Xianyu Zhao; Zhijian Ou

首页> 外文期刊>IEEE transactions on audio, speech and language processing >Closely Coupled Array Processing and Model-Based Compensation for Microphone Array Speech Recognition

【24h】

Closely Coupled Array Processing and Model-Based Compensation for Microphone Array Speech Recognition

机译：麦克风阵列语音识别的紧密耦合阵列处理和基于模型的补偿

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In conventional microphone array speech recognition, the array processor and the speech recognizer are loosely coupled. The only connection between the two modules is the enhanced target signal output from the array processor, which then gets treated as a single input to the recognizer. In this approach, useful environmental information, which can be provided by the array processor and also needs to be exploited by the recognizer, is ignored. Inherently, the array processor can generate multiple outputs of spatially filtered signals, as a multi-input-multi-output (MIMO) module. In this paper, a closely coupled approach is proposed, in which a recognizer with model-based noise compensation exploits the reference noise outputs from a MIMO array processor. Specifically, a multichannel model-based noise compensation is presented, including the compensation procedure using the vector Taylor series (VTS) expansion and parameter estimation using the expectation-maximization (EM) algorithm. It is also shown how to construct MIMO array processors from conventional beamformers. A number of practical implementations of the conventional loosely coupled approach and the proposed closely coupled approach were tested on a publicly available database, the Multichannel Overlapping Number Corpus (MONC). Experimental results showed that the proposed closely coupled approach significantly improved the speech recognition performance in the overlapping speech situations

机译：在传统的麦克风阵列语音识别中，阵列处理器和语音识别器是松散耦合的。这两个模块之间的唯一连接是从阵列处理器输出的增强目标信号，然后将其视为识别器的单个输入。在这种方法中，可以由阵列处理器提供并且也需要识别器利用的有用环境信息被忽略。作为一个多输入多输出（MIMO）模块，阵列处理器可以固有地生成空间滤波信号的多个输出。在本文中，提出了一种紧密耦合的方法，其中具有基于模型的噪声补偿的识别器利用MIMO阵列处理器的参考噪声输出。具体而言，提出了一种基于多通道模型的噪声补偿，包括使用矢量泰勒级数（VTS）展开的补偿过程和使用期望最大化（EM）算法的参数估计。还显示了如何从常规波束形成器构建MIMO阵列处理器。在公开可用的数据库多通道重叠数字语料库（MONC）上测试了常规松散耦合方法和建议的紧密耦合方法的许多实际实现。实验结果表明，在语音重叠的情况下，所提出的紧密耦合方法显着提高了语音识别性能。

著录项

来源
《IEEE transactions on audio, speech and language processing》 |2007年第3期|p.1114-1122|共9页
作者
Xianyu Zhao; Zhijian Ou;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词
array signal processing; expectation-maximisation algorithm; filtering theory; microphone arrays; signal denoising; speech recognition; MIMO array processor; closely coupled array processing; enhanced target signal output; expectation-maximization algorithm; micro;

机译：阵列信号处理;期望最大化算法;滤波理论;麦克风阵列;信号去噪;语音识别;MIMO阵列处理器;紧密耦合阵列处理;增强目标信号输出;期望最大化算法;微;

相似文献

外文文献
中文文献
专利

1. Microphone Array Processing for Distant Speech Recognition: From Close-Talking Microphones to Far-Field Sensors [J] . Kumatani K., Mcdonough J., Raj B. Signal Processing Magazine, IEEE . 2012,第6期

机译：远距离语音识别的麦克风阵列处理：从近距离麦克风到远场传感器
2. Adaptive microphone array processing for high-performance speech recognition in car environment [J] . Consumer Electronics, IEEE Transactions on . 2011,第1期

机译：自适应麦克风阵列处理，可在汽车环境中实现高性能语音识别
3. Robust Distant Speech Recognition by Combining Multiple Microphone-Array Processing with Position-Dependent CMN [J] . Longbiao Wang, Norihide Kitaoka, Seiichi Nakagawa EURASIP journal on applied signal processing . 2006,第20期

机译：通过将多个麦克风阵列处理与位置相关的CMN相结合，实现鲁棒的远程语音识别
4. Closely Coupled Array Processing and Model-Based Compensation for Microphone Array Speech Recognition [C] . Xianyu Zhao, Zhijian Ou, Minhua Chen, . 2005

机译：麦克风阵列语音识别的紧密耦合阵列处理和基于模型的补偿
5. Robust speech processing based on microphone array, audio-visual, and frame selection for in-vehicle speech recognition and in-set speaker recognition. [D] . Zhang, Xianxian. 2005

机译：基于麦克风阵列，视听和帧选择的强大语音处理功能，可实现车载语音识别和内置说话人识别。
6. A Real-Time Speech Separation Method Based on Camera and Microphone Array Sensors Fusion Approach [O] . Ching-Feng Liu, Wei-Siang Ciou, Peng-Ting Chen, 2020

机译：基于摄像头和麦克风阵列传感器融合方法的实时语音分离方法
7. Microphone array processing for distant speech recognition: From close-talking microphones to far-field sensors [O] . Kenichi Kumatani, Takayuki Arakawa, Kazumasa Yamamoto, 2012

机译：用于远距离语音识别的麦克风阵列处理：从近距离麦克风到远场传感器

Closely Coupled Array Processing and Model-Based Compensation for Microphone Array Speech Recognition

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅