...
首页> 外文期刊>Computer speech and language >Online blind speech separation using multiple acoustic speaker tracking and time-frequency masking
【24h】

Online blind speech separation using multiple acoustic speaker tracking and time-frequency masking

机译:使用多个声学扬声器跟踪和时频掩蔽的在线盲语音分离

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Separating speech signals of multiple simultaneous talkers in a reverberant enclosure is known as the cocktail party problem. In real-time applications online solutions capable of separating the signals as they are observed are required in contrast to separating the signals offline after observation. Often a talker may move, which should also be considered by the separation system. This work proposes an online method for speaker detection, speaker direction tracking, and speech separation. The separation is based on multiple acoustic source tracking (MAST) using Bayesian filtering and time-frequency masking. Measurements from three room environments with varying amounts of reverberation using two different designs of microphone arrays are used to evaluate the capability of the method to separate up to four simultaneously active speakers. Separation of moving talkers is also considered. Results are compared to two reference methods: ideal binary masking (IBM) and oracle tracking (O-T). Simulations are used to evaluate the effect of number of microphones and their spacing.
机译:在混响罩中分离多个同时讲话者的语音信号被称为鸡尾酒会问题。在实时应用中,与观察后离线分离信号相反,需要能够在观察时分离信号的在线解决方案。说话者经常会移动,分离系统也应考虑。这项工作提出了一种用于说话人检测,说话人方向跟踪和语音分离的在线方法。分离基于使用贝叶斯滤波和时频掩蔽的多声源跟踪(MAST)。使用两种不同设计的麦克风阵列,在三个具有不同混响量的房间环境中进行测量,以评估该方法分离多达四个同时活动的扬声器的能力。还考虑了移动说话者的分离。将结果与两种参考方法进行比较:理想二进制掩码(IBM)和oracle跟踪(O-T)。模拟用于评估麦克风数量及其间隔的影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号