首页> 外文期刊>Applied Acoustics >Real-time processing using the frequency domain binaural model
【24h】

Real-time processing using the frequency domain binaural model

机译:使用频域双耳模型进行实时处理

获取原文
获取原文并翻译 | 示例
       

摘要

There are many approaches to achieving high-performance speech enhancement. The modeling of the human auditory system is a good approach, since human beings can focus on target speech under concurrent speech conditions. One example of the binaural models is the time domain binaural model. However, this model has a high-calculation cost because the algorithm is based on auto-correlation, which is computationally intensive. Another example is the frequency domain binaural model proposed by Nakashima et al. [Nakashima H, Chisaki Y, Usagawa T, Ebata M. Frequency domain binaural model based on interaural phase and level differences. Acoust Sci Technol 2003;24(4):172-8]. Since the frequency domain binaural model uses the fast fourier transform, the calculation cost is much lower than that of the time domain binaural model. Therefore, it is not difficult to perform real-time processing using recent hardware such as digital signal processors and even laptop personal computers. However the quality of the segregated sound obtained using the frequency domain binaural model depends on system parameters such as frequency resolution and frame shift length for overlap adding in time domain. This paper introduces the construction of a prototype of a hearing assistant system based on the frequency domain binaural model. The detailed implementation techniques and parameter tuning are mentioned. The proposed system runs in real-time after parameter tuning. The directional attenuation levels, that is, the directivity patterns of the proposed system is measured. Finally, it is shown that the prototype can extract sounds coming from specific directions in real-time.
机译:有许多方法可以实现高性能语音增强。人类听觉系统的建模是一种很好的方法,因为人类可以在并发语音条件下专注于目标语音。双耳模型的一个例子是时域双耳模型。但是,该模型的计算成本很高,因为该算法基于自相关,而自相关是计算密集型的。另一个例子是中岛等人提出的频域双耳模型。 [Nakashima H,Chisaki Y,Usagawa T,EbataM。基于耳间相位和电平差异的频域双耳模型。 Acoust Sci Technol 2003; 24(4):172-8]。由于频域双耳模型使用快速傅里叶变换,因此计算成本大大低于时域双耳模型。因此,使用诸如数字信号处理器甚至膝上型个人计算机之类的最新硬件执行实时处理并不困难。但是,使用频域双耳模型获得的分离声音的质量取决于系统参数,例如频率分辨率和时域重叠叠加的移码长度。本文介绍了基于频域双耳模型的助听器系统原型的构建。提到了详细的实现技术和参数调整。拟议的系统在参数调整后实时运行。测量方向衰减水平,即所提出系统的方向性图。最后,表明原型可以实时提取来自特定方向的声音。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号