首页> 外文会议>Pacific-Rim conference on multimedia >A Sound Image Reproduction Model Based on Personalized Weight Vectors
【24h】

A Sound Image Reproduction Model Based on Personalized Weight Vectors

机译:基于个性化权重向量的声像再现模型

获取原文

摘要

Many perceptual models for audio reconstruction have been proposed to create the virtual sound, but the direction of the virtual sound maybe deviate from the desired direction due to the distortion of binaural cues. In this paper, a binaural cues' equation for real sound and virtual one reproduced by dual loudspeakers is established to derive weight vectors based on the head-related transfer function (HRTF). After being filtered by the weight vectors, sound signals emitted from the loudspeakers can deliver an accurate spatial impression to the listener. However, the HRTFs change with listeners, by which the weight vectors calculated also vary from person to person. Therefore, a radial basis function neural network (RBFNN) is designed to personalize weight vectors for each specific listener. Compared with the three methods including vector base amplitude panning (VBAP), the HRTF-based panning (HP) and the band-based panning (BP), the method in this paper can reproduce binaural cues more accurately, and subjective test also indicates that there is no significant difference in perception between real sound and virtual sound based on the proposed methods.
机译:已经提出了许多用于音频重构的感知模型来创建虚拟声音,但是由于双耳提示的失真,虚拟声音的方向可能会偏离期望的方向。在本文中,建立了双扬声器再现的真实声音和虚拟声音的双耳提示方程,以基于头相关传递函数(HRTF)导出权重向量。从权重向量过滤后,从扬声器发出的声音信号可以向听众传递准确的空间印象。但是,HRTF随听众而变化,由此计算出的权重向量也会因人而异。因此,设计了径向基函数神经网络(RBFNN)来个性化每个特定收听者的权重向量。与矢量基振幅平移(VBAP),基于HRTF的平移(HP)和基于频带的平移(BP)三种方法相比,本文中的方法可以更准确地重现双耳线索,并且主观测试还表明:基于所提出的方法,在真实声音和虚拟声音之间的感知上没有显着差异。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号