首页> 外文会议>European Signal Processing Conference >LCMV Beamformer with DNN-Based Multichannel Concurrent Speakers Detector
【24h】

LCMV Beamformer with DNN-Based Multichannel Concurrent Speakers Detector

机译:具有基于DNN的多通道并发扬声器检测器的LCMV波束形成器

获取原文

摘要

Application of the linearly constrained minimum variance (LCMV) beamformer (BF) to speaker extraction tasks in real-life scenarios necessitates a sophisticated control mechanism to facilitate the estimation of the noise spatial cross-power spectral density (cPSD) matrix and the relative transfer function (RTF) of all sources of interest. We propose a deep neural network (DNN)-based multichannel concurrent speakers detector (MCCSD) that utilizes all available microphone signals to detect the activity patterns of all speakers. Time frames classified as no active speaker frames will be utilized to estimate the cPSD, while time frames with a single detected speaker will be utilized for estimating the associated RTF. No estimation will take place during concurrent speaker activity. Experimental results show that the multi-channel approach significantly improves its single-channel counterpart.
机译:在实际场景中将线性约束最小方差(LCMV)波束形成器(BF)应用到说话人提取任务中,需要一种复杂的控制机制来促进噪声空间跨功率谱密度(cPSD)矩阵和相对传递函数的估计(RTF)的所有关注来源。我们提出了一种基于深度神经网络(DNN)的多通道并发扬声器检测器(MCCSD),该检测器利用所有可用的麦克风信号来检测所有扬声器的活动模式。分类为无活动说话者帧的时间帧将用于估计cPSD,而具有单个检测到的说话者的时间帧将用于估计相关的RTF。在并发发言人活动期间不会进行任何估计。实验结果表明,多通道方法显着改善了单通道方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号