首页> 外文会议>2002 China-Japan Joint Conference on Acoustics Nov 14-17, 2002 Nanjing, China >SIGNAL ENHANCEMENT USING FREQUENCY DOMAIN BINAURAL MODEL FOR HUMANOID ROBOT
【24h】

SIGNAL ENHANCEMENT USING FREQUENCY DOMAIN BINAURAL MODEL FOR HUMANOID ROBOT

机译:基于频域双模型的类人机器人信号增强

获取原文
获取原文并翻译 | 示例

摘要

As well known, a "cock-tail party effect" is a binaural effect to emphasize a degraded signal by surrounding noise. Psychoacoustical models of binaural effect have been studied. This paper proposes a frequency domain binaural model using FFT base filter-bank to reduce the computational load. This model is based on both interaural level difference(ILD) and interaural time difference(ITD) of binaural inputs, and weighting balances of ILD and ITD are tuned depending on frequency. The automatic speeh recognition system equipped with this binaural model can reach over 90% correct recognition when direction of noise source is more than 15 degree apart and when SNR=+5dB. Also intelligibility of enhanced speech is quit hight.
机译:众所周知,“鸡尾酒会效应”是双耳效应,以通过周围的噪声来强调劣化的信号。研究了双耳效应的心理声学模型。本文提出了一种基于FFT基滤波器组的频域双耳模型,以减少计算量。该模型基于双耳输入的耳间水平差(ILD)和耳间时间差(ITD),并且根据频率调整ILD和ITD的权重平衡。当噪声源的方向相隔15度以上且SNR = + 5dB时,配备有该双耳模型的自动语音识别系统可以达到90%以上的正确识别。也增强了语音的清晰度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号