首页> 外文OA文献 >An investigation into the real-time manipulation and control of three-dimensional sound fields
【2h】

An investigation into the real-time manipulation and control of three-dimensional sound fields

机译:三维声场的实时操纵与控制研究

摘要

This thesis describes a system that can be used for the decoding of a three dimensional audio recording over headphones or two, or more, speakers. A literature review of psychoacoustics and a review (both historical and current) of surround sound systems is carried out. The need for a system which is platform independent is discussed, and the proposal for a system based on an amalgamation of Ambisonics, binaural and transaural reproduction schemes is given. In order for this system to function optimally, each of the three systems rely on providing the listener with the relevant psychoacoustic cues. The conversion from a five speaker ITU array to binaural decode is well documented but pair-wise panning algorithms will not produce the correct lateralisation parameters at the ears of a centrally seated listener. Although Ambisonics has been well researched, no one has, as yet, produced a psychoacoustically optimised decoder for the standard irregular five speaker array as specified by the ITU as the original theory, as proposed by Gerzon and Barton (1992) was produced (known as a Vienna decoder), and example solutions given, before the standard had been decided on. In this work, the original work by Gerzon and Barton (1992) is analysed, and shown to be suboptimal, showing a high/low frequency decoder mismatch due to the method of solving the set of non-linear simultaneous equations. A method, based on the Tabu search algorithm, is applied to the Vienna decoder problem and is shown to provide superior results to those shown by Gerzon and Barton (1992) and is capable of producing multiple solutions to the Vienna decoder problem. During the write up of this report Craven (2003) has shown how 4th order circular harmonics (as used in Ambisonics) can be used to create a frequency independent panning law for the five speaker ITU array, and this report also shows how the Tabu search algorithm can be used to optimise these decoders further. A new method is then demonstrated using the Tabu search algorithm coupled with lateralisation parameters extracted from a binaural simulation of the Ambisonic system to be optimised (as these are the parameters that the Vienna system is approximating). This method can then be altered to take into account head rotations directly which have been shown as an important psychoacoustic parameter in the localisation of a sound source (Spikofski et al., 2001) and is also shown to be useful in differentiating between decoders optimised using the Tabu search form of the Vienna optimisations as no objective measure had been suggested. Optimisations for both Binaural and Transaural reproductions are then discussed so as to maximise the performance of generic HRTF data (i.e. not individualised) using inverse filtering methods, and a technique is shown that minimises the amount of frequency dependant regularisation needed when calculating cross-talk cancellation filters.
机译:本文介绍了一种可用于解码耳机或两个或多个扬声器上的三维音频记录的系统。进行了关于心理声学的文献综述和对环绕声系统的回顾(历史和当前的回顾)。讨论了对与平台无关的系统的需求,并提出了基于混合声,双耳和双耳再现方案的系统的建议。为了使该系统发挥最佳功能,这三个系统中的每一个都依赖于为收听者提供相关的心理声学提示。从5个扬声器的ITU阵列到双耳解码的转换已有详细记录,但成对的平移算法将不会在居中位置的听众的耳朵产生正确的侧向参数。尽管对Ambisonics进行了充分的研究,但迄今尚无人为Geron和Barton(1992)提出的,针对ITU指定为原始理论的标准不规则五扬声器阵列生产出一种心理声学优化的解码器。维也纳解码器),以及在确定标准之前给出的示例解决方案。在这项工作中,分析了Gerzon和Barton(1992)的原始工作,结果显示是次优的,由于解决了非线性联立方程组的方法,显示了高/低频解码器不匹配。一种基于禁忌搜索算法的方法应用于维也纳解码器问题,并显示出比Gerzon和Barton(1992)所示的结果更好的结果,并且能够为维也纳解码器问题提供多种解决方案。在撰写此报告期间,Craven(2003)展示了如何使用四阶圆形谐波(在Ambisonics中使用)为五个扬声器ITU阵列创建频率独立的声相定律,并且该报告还显示了禁忌搜索可以使用算法进一步优化这些解码器。然后,使用禁忌搜索算法和从待优化的Ambisonic系统的双耳模拟中提取的侧向化参数(因为这些是Vienna系统近似的参数),展示了一种新方法。然后可以更改此方法以直接考虑头部旋转,头部旋转已显示为声源定位中的重要心理声学参数(Spikofski等人,2001年),并且还显示出在区分使用维也纳优化的禁忌搜索表,因为没有建议采取客观措施。然后讨论了双耳和双耳复制的优化,以便使用逆滤波方法最大化通用HRTF数据的性能(即未个性化),并且显示了一种技术,该技术可最大程度地减少计算串扰消除时所需的频率相关正则化量过滤器。

著录项

  • 作者

    Wiggins Bruce;

  • 作者单位
  • 年度 2004
  • 总页数
  • 原文格式 PDF
  • 正文语种 {"code":"en","name":"English","id":9}
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号