首页> 外国专利> MULTI-MICROPHONE METHOD FOR ESTIMATION OF TARGET AND NOISE SPECTRAL VARIANCES FOR SPEECH DEGRADED BY REVERBERATION AND OPTIONALLY ADDITIVE NOISE

MULTI-MICROPHONE METHOD FOR ESTIMATION OF TARGET AND NOISE SPECTRAL VARIANCES FOR SPEECH DEGRADED BY REVERBERATION AND OPTIONALLY ADDITIVE NOISE

机译:估计重音和选择性加性降噪后的语音目标和噪声频谱方差的多麦克风方法

摘要

The application relates to an audio processing system and a method of processing a noisy (e.g. reverberant) signal comprising first (v) and optionally second (w) noise signal components and a target signal component (x), the method comprising a) Providing or receiving a time-frequency representation Yi(k,m) of a noisy audio signal yi at an ith input unit, i=1, 2, . . . , M, where M≧2; b) Providing (e.g. predefined spatial) characteristics of said target signal component and said noise signal component(s); and c) Estimating spectral variances or scaled versions thereof λV, λX of said first noise signal component v (representing reverberation) and said target signal component x, respectively, said estimates of λV and λX being jointly optimal in maximum likelihood sense, based on the statistical assumptions that a) the time-frequency representations Yi(k,m), Xi(k,m), and Vi(k,m) (and Wi(k,m)) of respective signals yi(n), and signal components xi and vi (and wi) are zero-mean, complex-valued Gaussian distributed, b) that each of them are statistically independent across time m and frequency k, and c) that Xi(k,m) and Vi(k,m) (and Wi(k,m)) are uncorrelated. An advantage of the invention is that it provides the basis for an improved intelligibility of an input speech signal. The invention may e.g. be used for hearing assistance devices, e.g. hearing aids.
机译:本申请涉及音频处理系统和处理包括第一(v)和可选地第二(w)噪声信号分量和目标信号分量(x)的噪声(例如,混响)信号的方法,该方法包括:a)提供或在第i 输入单元处接收噪声音频信号y i 的时频表示Y i (k,m),i = 1、2, 。 。 ,M,其中M≥2; b)提供所述目标信号分量和所述噪声信号分量的(例如预定的空间)特性; c)分别估计所述第一噪声信号分量v(代表混响)和所述目标信号分量x的频谱方差或其标度版本λ V ,λ X λ V 和λ X 的估计在最大似然意义上共同最优,基于以下统计假设:a)时频表示Y i (k,m),X i (k,m)和V i (k,m)(和W i (k ,m))的信号y i (n),以及信号分量x i 和v i (以及w i < / Sub>)是零均值,复数值高斯分布,b)每个变量在时间m和频率k上在统计上都是独立的,以及c)X i (k,m)和V i (k,m)(和W i (k,m))不相关。本发明的优点在于,它为提高输入语音信号的清晰度提供了基础。本发明可以例如是。用于助听器,例如助听器。

著录项

  • 公开/公告号US2015256956A1

    专利类型

  • 公开/公告日2015-09-10

    原文格式PDF

  • 申请/专利权人 OTICON A/S;

    申请/专利号US201514640664

  • 发明设计人 ADAM KUKLASINSKI;JESPER JENSEN;

    申请日2015-03-06

  • 分类号H04R29/00;H04R25/00;

  • 国家 US

  • 入库时间 2022-08-21 15:23:58

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号