首页> 外文OA文献 >Studies on binaural and monaural signal analysis methods and applications
【2h】

Studies on binaural and monaural signal analysis methods and applications

机译:双耳和单耳信号分析方法及应用研究

摘要

Sound signals can contain a lot of information about the environment and the sound sources present in it. This thesis presents novel contributions to the analysis of binaural and monaural sound signals. Some new applications are introduced in this work, but the emphasis is on analysis methods. The three main topics of the thesis are computational estimation of sound source distance, analysis of binaural room impulse responses, and applications intended for augmented reality audio. A novel method for binaural sound source distance estimation is proposed. The method is based on learning the coherence between the sounds entering the left and right ears. Comparisons to an earlier approach are also made. It is shown that these kinds of learning methods can correctly recognize the distance of a speech sound source in most cases. Methods for analyzing binaural room impulse responses are investigated. These methods are able to locate the early reflections in time and also to estimate their directions of arrival. This challenging problem could not be tackled completely, but this part of the work is an important step towards accurate estimation of the individual early reflections from a binaural room impulse response. As the third part of the thesis, applications of sound signal analysis are studied. The most notable contributions are a novel eyes-free user interface controlled by finger snaps, and an investigation on the importance of features in audio surveillance. The results of this thesis are steps towards building machines that can obtain information on the surrounding environment based on sound. In particular, the research into sound source distance estimation functions as important basic research in this area. The applications presented could be valuable in future telecommunications scenarios, such as augmented reality audio.
机译:声音信号可能包含许多有关环境和其中存在的声源的信息。本文为双耳和单耳声信号的分析提出了新的贡献。这项工作中介绍了一些新的应用程序,但重点是分析方法。本文的三个主要主题是声源距离的计算估计,双耳室冲激响应分析以及旨在用于增强现实音频的应用。提出了一种双耳声源距离估计的新方法。该方法基于学习进入左耳和右耳的声音之间的连贯性。还与早期方法进行了比较。结果表明,在大多数情况下,这些学习方法可以正确识别语音声源的距离。研究了分析双耳房间冲激响应的方法。这些方法能够及时定位早期反射并估计其到达方向。这个具有挑战性的问题无法完全解决,但是这部分工作是迈向从双耳室冲动响应准确估计各个早期反射的重要一步。作为论文的第三部分,研究了声音信号分析的应用。最显着的贡献是通过手指弹跳控制的新颖的免眼用户界面,以及对音频监控功能的重要性的研究。本文的结果是迈向构建可以基于声音获取周围环境信息的机器的步骤。特别地,声源距离估计的研究作为该领域的重要基础研究。提出的应用程序在增强现实音频等未来电信场景中可能会很有价值。

著录项

  • 作者

    Vesa Sampo;

  • 作者单位
  • 年度 2009
  • 总页数
  • 原文格式 PDF
  • 正文语种 en
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号