首页> 外文学位 >Acoustic Reflector Localisation for Blind Source Separation and Spatial Audio
【24h】

Acoustic Reflector Localisation for Blind Source Separation and Spatial Audio

机译:声反射器定位,用于盲源分离和空间音频

获取原文
获取原文并翻译 | 示例

摘要

From a physical point of view, sound is classically defined by wave functions. Like every other physical model based on waves, during its propagation, it interacts with the obstacles it encounters. These interactions result in reflections of the main signal that can be defined as either being supportive or interfering. In the signal processing research field, it is, therefore, important to identify these reflections, in order to either exploit or avoid them, respectively.;The main contribution of this thesis focuses on the acoustic reflector localisation. Four novel methods are proposed: a method localising the image source before finding the reflector position; two variants of this method, which utilise information from multiple loudspeakers; a method directly localising the reflector without any pre-processing. Finally, utilising both simulated and measured data, a comparative evaluation is conducted among different acoustic reflector localisation methods. The results show the last proposed method outperforming the state-of-the-art. The second contribution of this thesis is given by applying the acoustic reflector localisation solution into spatial audio, with the main objective of enabling the listeners with the sensation of being in the recorded environment. A novel way of encoding and decoding the room acoustic information is proposed, by parametrising sounds, and defining them as reverberant spatial audio objects (RSAOs). A set of subjective assessments are performed. The results prove both the high quality of the sound produced by the proposed parametrisation, and the reliability on manually modifying the acoustic of recorded environments. The third contribution is proposed in the field of speech source separation. A modified version of a state-of-the-art method is presented, where the direct sound and first reflection information is utilised to model binaural cues. Experiments were performed to separate speech sources in different environments. The results show the new method to outperform the state-of-the-art, where one interferer is present in the recordings.;The simulation and experimental results presented in this thesis represent a significant addition to the literature and will influence the future choices of acoustic reflector localisation systems, 3D rendering, and source separation techniques. Future work may focus on the fusion of acoustic and visual cues to enhance the acoustic scene analysis.
机译:从物理角度来看,声音通常由波动函数定义。像所有其他基于波的物理模型一样,它在传播过程中会与遇到的障碍相互作用。这些相互作用导致主信号的反射,可以定义为支持性或干扰性。因此,在信号处理研究领域,重要的是识别这些反射,以便分别利用或避免它们。;本文的主要贡献集中在声反射器的定位上。提出了四种新颖的方法:一种在找到反射器位置之前定位图像源的方法;该方法的两个变体,它们利用来自多个扬声器的信息;一种无需任何预处理即可直接定位反射器的方法。最后,利用模拟和测量数据,在不同的声反射器定位方法之间进行比较评估。结果表明,最后提出的方法优于最新技术。本论文的第二个贡献是通过将声反射器定位解决方案应用到空间音频中,其主要目的是使听众有被录制环境的感觉。通过对声音进行参数化并将其定义为混响空间音频对象(RSAO),提出了一种编码和解码室内声学信息的新颖方法。进行了一组主观评估。结果证明了所提出的参数化所产生的声音的高质量,以及手动修改录制环境的声音的可靠性。在语音源分离领域中提出了第三项贡献。提出了一种最新方法的改进版本,其中直接声音和第一反射信息用于对双耳线索进行建模。进行实验以分离不同环境中的语音源。结果表明,这种新方法的性能优于现有技术,在录音中存在一个干扰源。本论文中的仿真和实验结果为文献提供了重要补充,并将影响未来的选择。声反射器定位系统,3D渲染和源分离技术。未来的工作可能集中在声音和视觉提示的融合上,以增强声音场景分析。

著录项

  • 作者

    Remaggi, Luca.;

  • 作者单位

    University of Surrey (United Kingdom).;

  • 授予单位 University of Surrey (United Kingdom).;
  • 学科 Acoustics.;Electrical engineering.
  • 学位 Ph.D.
  • 年度 2017
  • 页码 212 p.
  • 总页数 212
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号