首页> 外文期刊>IEEE transactions on systems, man, and cybernetics. Part A, Systems and humans >Speech Separation of Multiple Moving Speakers Using Multisensor Multitarget Techniques
【24h】

Speech Separation of Multiple Moving Speakers Using Multisensor Multitarget Techniques

机译:使用多传感器多目标技术的多个移动扬声器的语音分离

获取原文
获取原文并翻译 | 示例
       

摘要

The general problem addressed in this paper is that of separating the voices of active moving speakers in the presence of background noise and moderate reverberation level in the acoustic field using a single microphone array. We adapt the multisensor multitarget tracking theory to the context of microphone arrays in order to form receptive beams that lock on each moving speaker on an extended time basis and therefore, achieve voice separation. Our approach: 1) incorporates kinematical information of speakers' movement by using an interacting multiple model (IMM) estimator per speaker in order to constrain the evolution of direction of arrival (DOA) measurements, which characterize various motions of the speakers, and 2) can directly account for measurement origin uncertainty, i.e., which measurement comes from which speaker, by using the probabilistic-data-association technique in conjunction with the IMM estimator. The effectiveness of the approach is illustrated by an extensive simulation study on tracking the DOAs of two speakers with crossing trajectories and three static speakers having a conversation with partially overlapping speech and long pauses
机译:本文解决的一般问题是使用单个麦克风阵列在存在背景噪声和适度混响水平的情况下在声场中分离主动移动扬声器的声音。我们将多传感器多目标跟踪理论应用于麦克风阵列的环境,以便形成可以在延长的时间基础上锁定在每个移动扬声器上的接收束,从而实现语音分离。我们的方法:1)通过使用每个发言者的交互多模型(IMM)估计器来合并发言者运动的运动学信息,以限制到达方向(DOA)测量的演变,以表征发言者的各种动作,以及2)通过结合使用概率数据关联技术和IMM估计器,可以直接考虑测量起点的不确定性,即哪个测量来自哪个说话者。该方法的有效性通过广泛的模拟研究得到了证明,该研究跟踪了具有交叉轨迹的两个说话者和三个静态说话者的对话的DOA,对话中的会话具有部分重叠的语音和长时间的停顿

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号