首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Frame-by-Frame Closed-Form Update for Mask-Based Adaptive MVDR Beamforming
【24h】

Frame-by-Frame Closed-Form Update for Mask-Based Adaptive MVDR Beamforming

机译:基于掩码的自适应MVDR波束形成的逐帧闭合表单更新

获取原文

摘要

Beamforming approaches using time-frequency masks have recently been investigated and have shown promising results for noise robust automatic speech recognition (ASR) in many tasks. The time-frequency masks are estimated to compute the spatial statistics of target speech and noise signals, and then the statistics are used to derive a beamformer. Although its effectiveness has been clearly shown in batch and blockwise processing, it has not been well extended to frame-by-frame processing, which is a very important procedure for many actual applications. In this paper, we derive a frame-by-frame update rule for a mask-based minimum variance distortion-less response (MVDR) beamformer, which enables us to obtain enhanced signals without a long delay by combining it with uni-directional recurrent neural network-based mask estimation. Based on the Woodbury matrix identity, our algorithm achieves a closed-form solution of the mask-based MVDR beamformer at every time frame without any matrix inversion. Experimental results show that our frame-by-frame beamformer outperforms baseline block-wise beamforming on the CHiME-3 simulation dataset even with a shorter time delay.
机译:最近已经研究了使用时频掩模的波束成形方法,并在许多任务中显示了对噪声鲁棒自动语音识别(ASR)的有希望的结果。估计时频掩模来计算目标语音和噪声信号的空间统计,然后使用统计来导出波束形成器。虽然它的有效性已被批量和群体处理清楚地显示,但它没有很好地扩展到逐帧处理,这是许多实际应用的一个非常重要的过程。在本文中,我们推导了基于掩模的最小方差失真响应(MVDR)波束形成器的帧逐帧更新规则,这使我们能够通过将其与单向反复性神经组合结合而没有长时间的延迟获得增强的信号基于网络的掩模估计。基于Woodbury矩阵标识,我们的算法在每次帧时达到基于掩模的MVDR波束形成器的闭合溶液,而没有任何矩阵反转。实验结果表明,我们的帧框架波束形成器越突出了基线块 - 明智的波束形成,即使时间延迟较短。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号