Adaptive beamformer with vowel/consonant identification based on average vowel/consonant spectrum for nosiy speech recognition

Masato NAKAYAMA; Takanobu NISHIURA; Hideki KAWAHARA

首页> 外文期刊>電子情報通信学会技術研究報告. 応用音響. Engineering Acoustics >Adaptive beamformer with vowel/consonant identification based on average vowel/consonant spectrum for nosiy speech recognition

【24h】

Adaptive beamformer with vowel/consonant identification based on average vowel/consonant spectrum for nosiy speech recognition

机译：基于平均元音/辅音谱的自适应波束形成器，用于嘈杂语音识别的平均元音/辅音谱

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

A microphone-array is an ideal candidate for capturing distant-talking speech. The AMNOR (Adaptive Microphone-array for NOise Reduction) is an adaptive beamformer proposed by Kaneda, et. al. In addition, as the beamformer for speech capture, S-AMNOR, the AMNOR with a long time speech spectrum was also proposed by Okada, et. al. In this paper, we propose the new AMNOR with adaptive filters for vowels/consonants, in order to improve the signal capturing performance. In addition, we automatically identify the vowels and consonants from output signal of the AMNOR by using a GMM (Gaussian Mixture Model). The performance of the proposed system is evaluated through experiments with a microphone array in a real room. As a result of evaluation experiments, the ASR (Automatic Speech Recognition) performance of proposed method achieved 65% in SNR = 10dB (speech comes from 90 deg. and noise comes from 50 deg.) environment, although the ASR performance of the conventional method is 60%.

机译：麦克风阵列是捕获遥远谈话言论的理想候选者。 AMNOR（用于降噪的自适应麦克风阵列）是Kaneda等的自适应波束形成器。 al。此外，作为语音捕获的波束形成器，S-AMNOR，还通过冈田等提出了具有长时间语音谱的AMNOR。 al。在本文中，我们向元音/辅音的自适应滤波器提出了新的AMNor，以改善信号捕获性能。此外，我们通过使用GMM（高斯混合模型）自动识别来自AMNOR的输出信号的元音和辅音。通过实际房间中的麦克风阵列进行实验，评估所提出的系统的性能。由于评估实验，所提出的方法的ASR（自动语音识别）性能在SNR = 10dB中实现了65％（语音来自90°。和噪声来自50°）环境，虽然是传统方法的ASR性能是60％。

著录项

来源
《電子情報通信学会技術研究報告. 応用音響. Engineering Acoustics》 |2003年第251期|共6页
作者
Masato NAKAYAMA; Takanobu NISHIURA; Hideki KAWAHARA;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 jpn
中图分类声学工程;
关键词
Microphone-array; Adaptive filter; Vowel/consonant identification; Noisy speech recognition;

机译：麦克风阵列;自适应过滤器;元音/辅音识别;嘈杂的演讲识别;

相似文献

外文文献
中文文献
专利

1. Adaptive beamformer with vowel/consonant identification based on average vowel/consonant spectrum for nosiy speech recognition [J] . Masato NAKAYAMA, Takanobu NISHIURA, Hideki KAWAHARA 電子情報通信学会技術研究報告. 応用音響. Engineering Acoustics . 2003,第251期

机译：基于平均元音/辅音频谱的具有元音/辅音识别的自适应波束形成器，用于噪声语音识别
2. Identification of vowels in consonant-vowel-consonant words from speech imagery based EEG signals [J] . Chengaiyan Sandhya, Retnapandian Anandha Sree, Anandan Kavitha Cognitive Neurodynamics . 2020,第1期

机译：基于语音图像的EEG信号辨识辅音元音辅音词元音
3. Consonant identification in consonant-vowel-consonant syllables in speech-spectrum noise [J] . David. L. Woods, E. William Yund, Timothy J. Herron, The Journal of the Acoustical Society of America . 2010,第3aPta1期

机译：语音频谱噪声中辅音元音节音节的辅音识别
4. Automatic speech Recognition of Non-Native Speakers Using Consonant-Vowel-Consonant (CVC) Words [C] . David A.van Leeuwen, Sander J.Van Wijngaarden 6th International Conference on Spoken Language Processing ICSLP 2000 Oct.16-Oct.20 2000 Beijing International Convention Center, Beijing, China . 2000

机译：使用辅音-辅音-辅音（CVC）词自动识别非母语说话者
5. The abstraction of onset letters in consonant-vowel-consonant words by pre-readers. [D] . Yoo, J. Helen. 2003

机译：预读者对辅音元音辅音词中的起始字母的抽象。
6. Identification of vowels in consonant–vowel–consonant words from speech imagery based EEG signals [O] . Sandhya Chengaiyan, Anandha Sree Retnapandian, Kavitha Anandan 2020

机译：基于语音图像的EEG信号辨识辅音元音辅音词元音
7. HMM-based Vowel and Consonant Automatic Recognition in Cued Speech for French [O] . Heracleous Panikos, Aboutabit Noureddine, Beautemps Denis 2009

机译：基于HMM的法语提示语音中的元音和辅音自动识别

Adaptive beamformer with vowel/consonant identification based on average vowel/consonant spectrum for nosiy speech recognition

摘要

著录项

相似文献

相关主题

期刊订阅