首页> 外文期刊>電子情報通信学会技術研究報告. 応用音響. Engineering Acoustics >Adaptive beamformer with vowel/consonant identification based on average vowel/consonant spectrum for nosiy speech recognition
【24h】

Adaptive beamformer with vowel/consonant identification based on average vowel/consonant spectrum for nosiy speech recognition

机译:基于平均元音/辅音谱的自适应波束形成器,用于嘈杂语音识别的平均元音/辅音谱

获取原文
获取原文并翻译 | 示例
           

摘要

A microphone-array is an ideal candidate for capturing distant-talking speech. The AMNOR (Adaptive Microphone-array for NOise Reduction) is an adaptive beamformer proposed by Kaneda, et. al. In addition, as the beamformer for speech capture, S-AMNOR, the AMNOR with a long time speech spectrum was also proposed by Okada, et. al. In this paper, we propose the new AMNOR with adaptive filters for vowels/consonants, in order to improve the signal capturing performance. In addition, we automatically identify the vowels and consonants from output signal of the AMNOR by using a GMM (Gaussian Mixture Model). The performance of the proposed system is evaluated through experiments with a microphone array in a real room. As a result of evaluation experiments, the ASR (Automatic Speech Recognition) performance of proposed method achieved 65% in SNR = 10dB (speech comes from 90 deg. and noise comes from 50 deg.) environment, although the ASR performance of the conventional method is 60%.
机译:麦克风阵列是捕获遥远谈话言论的理想候选者。 AMNOR(用于降噪的自适应麦克风阵列)是Kaneda等的自适应波束形成器。 al。 此外,作为语音捕获的波束形成器,S-AMNOR,还通过冈田等提出了具有长时间语音谱的AMNOR。 al。 在本文中,我们向元音/辅音的自适应滤波器提出了新的AMNor,以改善信号捕获性能。 此外,我们通过使用GMM(高斯混合模型)自动识别来自AMNOR的输出信号的元音和辅音。 通过实际房间中的麦克风阵列进行实验,评估所提出的系统的性能。 由于评估实验,所提出的方法的ASR(自动语音识别)性能在SNR = 10dB中实现了65%(语音来自90°。和噪声来自50°)环境,虽然是传统方法的ASR性能 是60%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号