首页> 外文学位 >Sound source separation via computational auditory scene analysis (CASA)-enhanced beamforming.
【24h】

Sound source separation via computational auditory scene analysis (CASA)-enhanced beamforming.

机译:通过计算听觉场景分析(CASA)增强的波束形成进行声源分离。

获取原文
获取原文并翻译 | 示例

摘要

In this work, techniques are developed and studied for the extraction of single-source acoustic signals out of multi-source mixtures. Such extracted signals can be used in a variety of applications including: automatic speech recognition, digital hearing aids, teleconferencing, and robot auditory systems. Most previous approaches fall into two categories: computational auditory scene analysis (CASA) and array signal processing.; The approach taken here is to combine these complementary techniques into an integrated one: CASA-enhanced beamforming. This integrated approach has the advantage of combining the array processing location attribute (direction of propagation through a sound-field) with the monaural CASA source attributes (fundamental frequency, on/offset, etc.). The motivation for the CASA-enhanced beamforming approach is the recognition that, by combining the statistically independent location and source attributes, more mixtures can be separated. A mixture that could not be separated by the location attribute alone (for example, if the single-source signals in the mixture have the same location attribute value) may be separated using source attributes, and vice versa.; An alternative to beamforming is binaural CASA. Beamforming is chosen for our integrated approach because it has the following advantages: (1) Since binaural CASA evolved to operate under the constraints of the human auditory system (with only two ears and spectral shaping due to the shape of the human body), it is not clear that it is an ideal method for a computer implementation. Beamforming is more flexible. It allows for any array geometry (number and arrangement of sensors). (2) The beamforming approach is mathematically derived based on a physical model of the acoustic wavefield. So, its processing effect is well-understood. (3) Beamforming operates via an analytic expression. So, its performance can be quantified (as a function of array geometry and the frequency content of the signals in the wavefield).; Experimental results show that CASA-enhanced beamforming extracts wideband signal estimates with higher signal-to-interference ratios (SIR) than monaural CASA, or beamforming alone. That is, it generates wideband signal estimates with the most interference rejection. Regarding intellibility, beamforming produces the lowest spectral distortion. However, CASA-enhanced beamforming's spectral distortion is shown to be comparable to monaural CASA's, and better than binaural CASA's.
机译:在这项工作中,开发和研究了从多源混合物中提取单源声信号的技术。这样提取的信号可以用于各种应用程序,包括:自动语音识别,数字助听器,电话会议和机器人听觉系统。先前的大多数方法分为两类:计算听觉场景分析(CASA)和阵列信号处理。此处采用的方法是将这些互补技术组合为一种集成技术:CASA增强波束成形。这种集成方法的优点是将阵列处理位置属性(通过声场传播的方向)与单声道CASA源属性(基本频率,开/关等)组合在一起。采用CASA增强波束成形方法的动机是认识到,通过组合统计独立的位置和源属性,可以分离更多的混合。不能单独由位置属性分隔的混合物(例如,如果混合物中的单源信号具有相同的位置属性值)可以使用源属性分隔,反之亦然。波束成形的替代方法是双耳CASA。选择波束成形是我们的综合方法,因为它具有以下优点:(1)自从双耳CASA演变为在人类听觉系统的约束下运行(由于人体的形状只有两只耳朵和频谱成形),尚不清楚这是否是计算机实现的理想方法。波束成形更加灵活。它允许任何阵列几何形状(传感器的数量和布置)。 (2)波束成形方法是基于声波场的物理模型从数学上推导的。因此,其加工效果是众所周知的。 (3)波束成形通过解析表达式进行操作。因此,它的性能可以量化(作为阵列几何形状和波场中信号的频率含量的函数)。实验结果表明,与单声道CASA或单独使用波束成形相比,增强了CASA的波束成形提取的宽带信号估计具有更高的信噪比(SIR)。也就是说,它产生干扰抑制最大的宽带信号估计。关于智能,波束成形产生的频谱失真最低。但是,已显示出CASA增强的波束成形的频谱失真与单声道CASA相当,并且比双耳CASA更好。

著录项

  • 作者

    Drake, Laura Ann.;

  • 作者单位

    Northwestern University.;

  • 授予单位 Northwestern University.;
  • 学科 Engineering Electronics and Electrical.
  • 学位 Ph.D.
  • 年度 2001
  • 页码 284 p.
  • 总页数 284
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 无线电电子学、电信技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号