首页> 外文OA文献 >A psychoacoustic engineering approach to machine sound source separation in reverberant environments
【2h】

A psychoacoustic engineering approach to machine sound source separation in reverberant environments

机译:一种用于混响环境中机器声源分离的心理声学工程方法

摘要

Reverberation continues to present a major problem for sound source separation algorithms, due to its corruption of many of the acoustical cues on which these algorithms rely. However, humans demonstrate a remarkable robustness to reverberation and many psychophysical and perceptual mechanisms are well documented. This thesis therefore considers the research question: can the reverberation–performance of existing psychoacoustic engineering approaches to machine source separation be improved? The precedence effect is a perceptual mechanism that aids our ability to localise sounds in reverberant environments. Despite this, relatively little work has been done on incorporating the precedence effect into automated sound source separation. Consequently, a study was conducted that compared several computational precedence models and their impact on the performance of a baseline separation algorithm. The algorithm included a precedence model, which was replaced with the other precedence models during the investigation. The models were tested using a novel metric in a range of reverberant rooms and with a range of other mixture parameters. The metric, termed Ideal Binary Mask Ratio, is shown to be robust to the effects of reverberation and facilitates meaningful and direct comparison between algorithms across different acoustic conditions. Large differences between the performances of the models were observed. The results showed that a separation algorithm incorporating a model based on interaural coherence produces the greatest performance gain over the baseline algorithm. The results from the study also indicated that it may be necessary to adapt the precedence model to the acoustic conditions in which the model is utilised. This effect is analogous to the perceptual Clifton effect, which is a dynamic component of the precedence effect that appears to adapt precedence to a given acoustic environment in order to maximise its effectiveness. However, no work has been carried out on adapting a precedence model to the acoustic conditions under test. Specifically, although the necessity for such a component has been suggested in the literature, neither its necessity nor benefit has been formally validated. Consequently, a further study was conducted in which parameters of each of the previously compared precedence models were varied in each room in order to identify if, and to what extent, the separation performance varied with these parameters. The results showed that the reverberation–performance of existing psychoacoustic engineering approaches to machine source separation can be improved and can yield significant gains in separation performance.
机译:由于混响破坏了这些算法所依赖的许多声音提示,因此混响仍然是声源分离算法的主要问题。然而,人类对混响表现出非凡的鲁棒性,并且许多心理物理和知觉机制都有据可查。因此,本论文考虑了以下研究问题:是否可以改善现有的心理声学工程方法在机器源分离中的混响效果?优先效果是一种感知机制,有助于我们在混响环境中定位声音。尽管如此,在将优先效果纳入自动声源分离中的工作还很少。因此,进行了一项研究,比较了几种计算优先级模型及其对基线分离算法性能的影响。该算法包括一个优先级模型,该模型在调查过程中被其他优先级模型替换。在一系列混响室和一系列其他混合参数下,使用新颖的度量标准对模型进行了测试。该度量标准被称为理想二进制掩码比率,显示出对混响效果的鲁棒性,并有助于在不同声学条件下的算法之间进行有意义且直接的比较。观察到模型之间的性能差异很大。结果表明,结合基于听觉相干性模型的分离算法比基线算法产生最大的性能提升。研究的结果还表明,可能有必要使优先模型适应使用该模型的声学条件。此效果类似于感知的克利夫顿效应,它是优先级效果的动态组成部分,该效果似乎使优先级适应给定的声学环境,从而最大程度地发挥其效果。但是,尚未进行使优先模型适应被测声学条件的工作。具体地,尽管在文献中已经提出了对于这种成分的必要性,但是其必要性和益处均未得到正式验证。因此,进行了进一步的研究,其中在每个房间中改变每个先前比较的优先模型的参数,以便确定分离性能是否以及在何种程度上随这些参数而变化。结果表明,现有的心理声学工程方法在机器源分离中的混响性能可以得到改善,并且可以显着提高分离性能。

著录项

  • 作者

    Hummersone C;

  • 作者单位
  • 年度 2011
  • 总页数
  • 原文格式 PDF
  • 正文语种 English
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号