Frame-by-Frame Closed-Form Update for Mask-Based Adaptive MVDR Beamforming

机译：基于掩码的自适应MVDR波束形成的逐帧闭合表单更新

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Beamforming approaches using time-frequency masks have recently been investigated and have shown promising results for noise robust automatic speech recognition (ASR) in many tasks. The time-frequency masks are estimated to compute the spatial statistics of target speech and noise signals, and then the statistics are used to derive a beamformer. Although its effectiveness has been clearly shown in batch and blockwise processing, it has not been well extended to frame-by-frame processing, which is a very important procedure for many actual applications. In this paper, we derive a frame-by-frame update rule for a mask-based minimum variance distortion-less response (MVDR) beamformer, which enables us to obtain enhanced signals without a long delay by combining it with uni-directional recurrent neural network-based mask estimation. Based on the Woodbury matrix identity, our algorithm achieves a closed-form solution of the mask-based MVDR beamformer at every time frame without any matrix inversion. Experimental results show that our frame-by-frame beamformer outperforms baseline block-wise beamforming on the CHiME-3 simulation dataset even with a shorter time delay.

机译：最近已经研究了使用时频掩模的波束成形方法，并在许多任务中显示了对噪声鲁棒自动语音识别（ASR）的有希望的结果。估计时频掩模来计算目标语音和噪声信号的空间统计，然后使用统计来导出波束形成器。虽然它的有效性已被批量和群体处理清楚地显示，但它没有很好地扩展到逐帧处理，这是许多实际应用的一个非常重要的过程。在本文中，我们推导了基于掩模的最小方差失真响应（MVDR）波束形成器的帧逐帧更新规则，这使我们能够通过将其与单向反复性神经组合结合而没有长时间的延迟获得增强的信号基于网络的掩模估计。基于Woodbury矩阵标识，我们的算法在每次帧时达到基于掩模的MVDR波束形成器的闭合溶液，而没有任何矩阵反转。实验结果表明，我们的帧框架波束形成器越突出了基线块 - 明智的波束形成，即使时间延迟较短。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2018年|605p|共5页
会议地点
作者
Takuya Higuchi; Keisuke Kinoshita; Nobutaka Ito; Shigeki Karita; Tomohiro Nakatani;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912-53;
关键词
Speech enhancement; beamforming; online processing; time-frequency masking;

机译：语音增强;波束成形;在线处理;时间频率掩蔽;

相似文献

外文文献
中文文献
专利

1. Mask-based blind source separation and MVDR beamforming in ASR [J] . Renke He, Yanhua Long, Yijie Li, International journal of speech technology . 2020,第1期

机译：ASR中基于掩模的盲源分离和MVDR波束形成
2. MVDR beamforming and generalized sidelobe cancellation based oninverse updating with residual extraction [J] . Moonen M., Proudler I.K. IEEE Transactions on Circuits and Systems. II, Express Briefs . 2000,第4期

机译：基于残差提取逆更新的MVDR波束形成和广义旁瓣抵消
3. MVDR beamforming and generalized sidelobe cancellation based on inverse updating with residual extraction [J] . Moonen M., Proudler I.K. IEEE Transactions on Circuits and Systems. II . 2000,第4期

机译：基于残差提取逆更新的MVDR波束形成和广义旁瓣抵消
4. Frame-by-Frame Closed-Form Update for Mask-Based Adaptive MVDR Beamforming [C] . Takuya Higuchi, Keisuke Kinoshita, Nobutaka Ito, IEEE International Conference on Acoustics, Speech and Signal Processing . 2018

机译：基于掩码的自适应MVDR波束形成的逐帧闭合表单更新
5. The Subarray MVDR Beamformer: A Space-Time Adaptive Processor Applied to Active Sonar [D] . Bezanson, Leverett Guidroz. 2013

机译：子阵列MVDR波束形成器：时空自适应处理器应用于Active Sonar
6. Iterative Robust Capon Beamforming with Adaptively Updated Array Steering Vector Mismatch Levels [O] . Tao Zhang, Liguo Sun 2014

机译：具有自适应更新的阵列转向矢量不匹配水平的迭代鲁棒Capon波束成形
7. MVDR beamforming with inverse updating [O] . Moonen M., Proudler I.K. 1998

机译：具有反向更新的MVDR波束成形

Frame-by-Frame Closed-Form Update for Mask-Based Adaptive MVDR Beamforming

摘要

著录项

相似文献

相关主题

期刊订阅