Distant Speech Recognition Using a Microphone Array Network

Alberto Yoshihiro NAKANO; Seiichi NAKAGAWA; Kazumasa YAMAMOTO

首页> 外文期刊>IEICE Transactions on Information and Systems >Distant Speech Recognition Using a Microphone Array Network

【24h】

Distant Speech Recognition Using a Microphone Array Network

机译：使用麦克风阵列网络进行远程语音识别

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this work, spatial information consisting of the position and orientation angle of an acoustic source is estimated by an artificial neural network (ANN). The estimated position of a speaker in an enclosed space is used to refine the estimated time delays for a delay-and-sum beam-former, thus enhancing the output signal. On the other hand, the orientation angle is used to restrict the lexicon used in the recognition phase, assuming that the speaker faces a particular direction while speaking. To compensate the effect of the transmission channel inside a short frame analysis window, a new cepstral mean normalization (CMN) method based on a Gaussian mixture model (GMM) is investigated and shows better performance than the conventional CMN for short utterances. The performance of the proposed method is evaluated through lapanese digit/command recognition experiments.

机译：在这项工作中，由人工神经网络（ANN）估算由声源的位置和定向角组成的空间信息。扬声器在封闭空间中的估计位置用于完善延迟和求和波束形成器的估计时间延迟，从而增强输出信号。另一方面，假设说话者说话时面对特定方向，则定向角用于限制识别阶段中使用的词典。为了补偿短帧分析窗口内传输通道的影响，研究了一种基于高斯混合模型（GMM）的新倒谱均值归一化（CMN）方法，该方法在短话语方面表现出比常规CMN更好的性能。通过拉丁美洲数字/命令识别实验评估了该方法的性能。

著录项

来源
《IEICE Transactions on Information and Systems》 |2010年第9期|P.2451-2462|共12页
作者
Alberto Yoshihiro NAKANO; Seiichi NAKAGAWA; Kazumasa YAMAMOTO;
展开▼
作者单位

Department of Information and Computer Sciences, Toyohashi University of Technology, Toyohashi-shi, 441-8580 Japan;

rnDepartment of Information and Computer Sciences, Toyohashi University of Technology, Toyohashi-shi, 441-8580 Japan;

rnDepartment of Information and Computer Sciences, Toyohashi University of Technology, Toyohashi-shi, 441-8580 Japan;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
distant speech recognition; microphone array network; GMM- based CMN; speaker's position and orientation estimation;

机译：远程语音识别;麦克风阵列网络;基于GMM的CMN;说话人的位置和方位估计;
入库时间 2022-08-18 00:27:00

相似文献

外文文献
中文文献
专利

1. Distant Speech Recognition Using a Microphone Array Network [J] . Alberto Yoshihiro NAKANO, Seiichi NAKAGAWA, Kazumasa YAMAMOTO IEICE transactions on information and systems . 2010,第9期

机译：使用麦克风阵列网络进行远程语音识别
2. Microphone Array Processing for Distant Speech Recognition: From Close-Talking Microphones to Far-Field Sensors [J] . Kumatani K., Mcdonough J., Raj B. Signal Processing Magazine, IEEE . 2012,第6期

机译：远距离语音识别的麦克风阵列处理：从近距离麦克风到远场传感器
3. Robust Distant Speech Recognition by Combining Multiple Microphone-Array Processing with Position-Dependent CMN [J] . Longbiao Wang, Norihide Kitaoka, Seiichi Nakagawa EURASIP journal on applied signal processing . 2006,第20期

机译：通过将多个麦克风阵列处理与位置相关的CMN相结合，实现鲁棒的远程语音识别
4. Microphone array processing for distant speech recognition: Spherical arrays [C] . McDonough John, Kumatani Kenichi, Raj Bhiksha 2012 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference. . 2012

机译：用于远距离语音识别的麦克风阵列处理：球形阵列
5. Robust speech processing based on microphone array, audio-visual, and frame selection for in-vehicle speech recognition and in-set speaker recognition. [D] . Zhang, Xianxian. 2005

机译：基于麦克风阵列，视听和帧选择的强大语音处理功能，可实现车载语音识别和内置说话人识别。
6. A Real-Time Speech Separation Method Based on Camera and Microphone Array Sensors Fusion Approach [O] . Ching-Feng Liu, Wei-Siang Ciou, Peng-Ting Chen, 2020

机译：基于摄像头和麦克风阵列传感器融合方法的实时语音分离方法
7. Microphone array processing for distant speech recognition: From close-talking microphones to far-field sensors [O] . Kenichi Kumatani, Takayuki Arakawa, Kazumasa Yamamoto, 2012

机译：用于远距离语音识别的麦克风阵列处理：从近距离麦克风到远场传感器

Distant Speech Recognition Using a Microphone Array Network

摘要

著录项

相似文献

相关主题

期刊订阅