首页> 外文会议>European Conference on Speech Communication and Technology >SPEECH SEGREGATION BASED ON FUNDAMENTAL EVENT INFORMATION USING AN AUDITORY VOCODER
【24h】

SPEECH SEGREGATION BASED ON FUNDAMENTAL EVENT INFORMATION USING AN AUDITORY VOCODER

机译:基于使用听觉声码器的基本事件信息的语音分离

获取原文

摘要

We present a new auditory method to segregate concurrent speech sounds. The system is based on an auditory vocoder developed to resynthesize speech from an auditory Mellin representation using the vocoder STRAIGHT. The auditory representation preserves fine temporal information, unlike conventional window-based processing, and this makes it possible to segregate speech sources with an event synchronous procedure. We developed a method to convert fundamental frequency information to estimate glottal pulse times so as to facilitate robust extraction of the target speech. The results show that the segregation is good even when the SNR is 0 dB; the extracted target speech was a little distorted but entirely intelligible, whereas the distracter speech was reduced to a non-speech sound that was not perceptually disturbing. So, this auditory vocoder has potential for speech enhancement in applications such as hearing aids.
机译:我们提出了一种新的听觉方法来分离并发语音声音。该系统基于经过视听声码器,该声码器开发用于使用声码器直线从听觉MELLIN表示中重新合成语音。与传统的基于窗口的处理不同,听觉表示保留了精细的时间信息,并且这使得可以通过事件同步过程分离语音源。我们开发了一种将基本频率信息转换为估计光泽脉冲次数的方法,以便于鲁棒提取目标语音。结果表明,即使SNR为0 dB,偏析也很好;提取的目标语言有点扭曲但完全可理解,而干扰言论被降低到没有感知令人扰乱的非语音声音。因此,这种听觉声码器具有助听器等应用中的语音增强潜力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号