首页> 外文会议>International Conference on Statistical Language and Speech Processing >A Study on Online Source Extraction in the Presence of Changing Speaker Positions
【24h】

A Study on Online Source Extraction in the Presence of Changing Speaker Positions

机译:演讲者位置发生变化时的在线音源提取研究

获取原文

摘要

Multi-talker speech and moving speakers still pose a significant challenge to automatic speech recognition systems. Assuming an enrollment utterance of the target speakeris available, the so-called SpeakerBeam concept has been recently proposed to extract the target speaker from a speech mixture. If multi-channel input is available, spatial properties of the speaker can be exploited to support the source extraction. In this contribution we investigate different approaches to exploit such spatial information. In particular, we are interested in the question, how useful this information is if the target speaker changes his/her position. To this end, we present a SpeakerBeam-based source extraction network that is adapted to work on moving speakers by recursively updating the beamformer coefficients. Experimental results are presented on two data sets, one with artificially created room impulse responses, and one with real room impulse responses and noise recorded in a conference room. Interestingly, spatial features turn out to be advantageous even if the speaker position changes.
机译:多讲话者语音和移动扬声器仍然对自动语音识别系统构成重大挑战。假设目标说话者的登记说话是可用的,最近已经提出了所谓的SpeakerBeam概念,以从语音混合中提取目标说话者。如果有多声道输入可用,则可以利用扬声器的空间特性来支持信号源提取。在这项贡献中,我们研究了利用这种空间信息的不同方法。特别是,我们对以下问题感兴趣:如果目标说话者更改其位置,此信息有多有用。为此,我们提出了一种基于SpeakerBeam的源提取网络,该网络适用于通过递归更新波束形成器系数来在移动扬声器上工作。实验结果显示在两个数据集上,一个具有人为创建的房间冲激响应,一个具有真实的房间冲激响应和在会议室记录的噪声。有趣的是,即使扬声器位置改变,空间特征也被证明是有利的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号