首页> 外文OA文献 >A psychoacoustic engineering approach to machine sound source separation in reverberant environments

【2h】

A psychoacoustic engineering approach to machine sound source separation in reverberant environments

机译：一种用于混响环境中机器声源分离的心理声学工程方法

页面导航

摘要
著录项
相似文献
相关主题

摘要

Reverberation continues to present a major problem for sound source separation algorithms, due to its corruption of many of the acoustical cues on which these algorithms rely. However, humans demonstrate a remarkable robustness to reverberation and many psychophysical and perceptual mechanisms are well documented. This thesis therefore considers the research question: can the reverberation–performance of existing psychoacoustic engineering approaches to machine source separation be improved? The precedence effect is a perceptual mechanism that aids our ability to localise sounds in reverberant environments. Despite this, relatively little work has been done on incorporating the precedence effect into automated sound source separation. Consequently, a study was conducted that compared several computational precedence models and their impact on the performance of a baseline separation algorithm. The algorithm included a precedence model, which was replaced with the other precedence models during the investigation. The models were tested using a novel metric in a range of reverberant rooms and with a range of other mixture parameters. The metric, termed Ideal Binary Mask Ratio, is shown to be robust to the effects of reverberation and facilitates meaningful and direct comparison between algorithms across different acoustic conditions. Large differences between the performances of the models were observed. The results showed that a separation algorithm incorporating a model based on interaural coherence produces the greatest performance gain over the baseline algorithm. The results from the study also indicated that it may be necessary to adapt the precedence model to the acoustic conditions in which the model is utilised. This effect is analogous to the perceptual Clifton effect, which is a dynamic component of the precedence effect that appears to adapt precedence to a given acoustic environment in order to maximise its effectiveness. However, no work has been carried out on adapting a precedence model to the acoustic conditions under test. Specifically, although the necessity for such a component has been suggested in the literature, neither its necessity nor benefit has been formally validated. Consequently, a further study was conducted in which parameters of each of the previously compared precedence models were varied in each room in order to identify if, and to what extent, the separation performance varied with these parameters. The results showed that the reverberation–performance of existing psychoacoustic engineering approaches to machine source separation can be improved and can yield significant gains in separation performance.

机译：由于混响破坏了这些算法所依赖的许多声音提示，因此混响仍然是声源分离算法的主要问题。然而，人类对混响表现出非凡的鲁棒性，并且许多心理物理和知觉机制都有据可查。因此，本论文考虑了以下研究问题：是否可以改善现有的心理声学工程方法在机器源分离中的混响效果？优先效果是一种感知机制，有助于我们在混响环境中定位声音。尽管如此，在将优先效果纳入自动声源分离中的工作还很少。因此，进行了一项研究，比较了几种计算优先级模型及其对基线分离算法性能的影响。该算法包括一个优先级模型，该模型在调查过程中被其他优先级模型替换。在一系列混响室和一系列其他混合参数下，使用新颖的度量标准对模型进行了测试。该度量标准被称为理想二进制掩码比率，显示出对混响效果的鲁棒性，并有助于在不同声学条件下的算法之间进行有意义且直接的比较。观察到模型之间的性能差异很大。结果表明，结合基于听觉相干性模型的分离算法比基线算法产生最大的性能提升。研究的结果还表明，可能有必要使优先模型适应使用该模型的声学条件。此效果类似于感知的克利夫顿效应，它是优先级效果的动态组成部分，该效果似乎使优先级适应给定的声学环境，从而最大程度地发挥其效果。但是，尚未进行使优先模型适应被测声学条件的工作。具体地，尽管在文献中已经提出了对于这种成分的必要性，但是其必要性和益处均未得到正式验证。因此，进行了进一步的研究，其中在每个房间中改变每个先前比较的优先模型的参数，以便确定分离性能是否以及在何种程度上随这些参数而变化。结果表明，现有的心理声学工程方法在机器源分离中的混响性能可以得到改善，并且可以显着提高分离性能。

著录项

作者
Hummersone C;
展开▼
作者单位

展开▼
年度 2011
总页数
原文格式 PDF
正文语种 English
中图分类

相似文献

外文文献
中文文献
专利

1. Robust sound source separation in a reverberant environment based on harmonic structure and sound source direction [J] . Takashi Yoshida, Tomohiro Nakatani, Hiroshi G. Okuno, 電子情報通信学会技術研究報告. 音声. Speech . 2003,第27期

机译：基于谐波结构和声源方向的混响环境中的稳健声源分离
2. Robust sound source separation in a reverberant environment based on harmonic structure and sound source direction [J] . Takashi Yoshida, Tomohiro Nakatani, Hiroshi G. Okuno, 電子情報通信学会技術研究報告. 応用音響. Engineering Acoustics . 2003,第25期

机译：基于谐波结构和声源方向的混响环境中的稳健声源分离
3. Robust sound source separation in a reverberant environment based on harmonic structure and sound source direction [J] . Takashi Yoshida, Tomohiro Nakatani, Hiroshi G. Okuno, 電子情報通信学会技術研究報告. 応用音響. Engineering Acoustics . 2003,第25期

机译：基于谐波结构和声源方向的混响环境中强大的声音源分离
4. Real-time source separation based on sound localization in a reverberant environment [C] . Aoki, M., Furuya, . 2002

机译：混响环境中基于声音定位的实时音源分离
5. Sound Source Localization in Complex Indoor Environment: A Self-supervised Incremental Learning Approach [D] . Zhang, Zeyu 2019

机译：复杂室内环境中的声源定位：一种自我监督的增量学习方法
6. Learning to localize sounds in a highly reverberant environment: Machine-learning tracking of dolphin whistle-like sounds in a pool [O] . Sean F. Woodward, Diana Reiss, Marcelo O. Magnasco, 2020

机译：学习在高音环境中定位声音：在游泳池中的海豚哨子的声音的机器学习跟踪
7. Informed algorithms for sound source separation in enclosed reverberant environments [O] . Khan Muhammad Salman 2013

机译：封闭式混响环境中声源分离的明智算法

A psychoacoustic engineering approach to machine sound source separation in reverberant environments

摘要

著录项

相似文献

相关主题

期刊订阅