首页> 外文会议>IEEE International Workshop on Machine Learning for Signal Processing >A hypothesis testing approach for real-time multichannel speech separation using time-frequency masks
【24h】

A hypothesis testing approach for real-time multichannel speech separation using time-frequency masks

机译:使用时频模板进行实时多通道语音分离的假设测试方法

获取原文

摘要

We propose a new approach to time-frequency mask generation for real-time multichannel speech separation. Whereas conventional approaches select the strongest source in each time-frequency bin, we perform a binary hypothesis test to determine whether a target source is present or not. We derive a generalized likelihood ratio test and extend it to underdetermined mixtures by aggregating the outputs of several tests with different interference models. This approach is justified by the nonstationarity and time-frequency disjointedness of speech signals. This computationally simple method is suitable for real-time source separation in resource-constrained and latency-critical applications.
机译:我们提出了一种用于实时多通道语音分离的时频掩码生成的新方法。常规方法在每个时频点中选择最强的源,而我们执行二元假设检验以确定目标源是否存在。通过汇总具有不同干扰模型的多个测试的输出,我们得出了广义似然比测试,并将其扩展到不确定混合。语音信号的非平稳性和时频脱节性证明了这种方法的合理性。这种计算简单的方法适用于资源受限和延迟关键的应用程序中的实时源分离。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号