首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Time Difference of Arrival Estimation of Speech Signals Using Deep Neural Networks with Integrated Time-frequency Masking
【24h】

Time Difference of Arrival Estimation of Speech Signals Using Deep Neural Networks with Integrated Time-frequency Masking

机译:集成时频掩膜的深度神经网络语音信号到达估计的时差

获取原文

摘要

The Time Difference of Arrival (TDoA) of a sound wavefront impinging on a microphone pair carries spatial information about the source. However, captured speech typically contains dynamic non-speech interference sources and noise. Therefore, the TDoA estimates fluctuate between speech and interference. Deep Neural Networks (DNNs) have been applied for Time-Frequency (TF) masking for Acoustic Source Localization (ASL) to filter out non-speech components from a speaker location likelihood function. However, the type of TF mask for this task is not obvious. Secondly, the DNN should estimate the TDoA values, but existing solutions estimate the TF mask instead. To overcome these issues, a direct formulation of the TF masking as a part of a DNN-based ASL structure is proposed. Furthermore, the proposed network operates in an online manner, i.e., producing estimates frame-by-frame. Combined with the use of recurrent layers it exploits the sequential progression of speaker related TDoAs. Training with different microphone spacings allows model re-use for different microphone pair geometries in inference. Real-data experiments with smartphone recordings of speech in interference demonstrate the network's generalization capability.
机译:撞击到麦克风对的声波前的到达时间差(TDoA)携带有关声源的空间信息。但是,捕获的语音通常包含动态的非语音干扰源和噪声。因此,TDoA估计在语音和干扰之间波动。深层神经网络(DNN)已应用于时频(TF)掩盖,以进行声源定位(ASL),以从说话人位置似然函数中滤除非语音成分。但是,用于此任务的TF掩码的类型并不明显。其次,DNN应该估计TDoA值,但是现有解决方案而是估计TF掩码。为了克服这些问题,提出了直接将TF掩蔽公式化为基于DNN的ASL结构的一部分。此外,所提出的网络以在线方式操作,即,逐帧地产生估计。与循环层的使用相结合,它可以利用与说话人相关的TDoA的顺序进行。通过使用不同的麦克风间距进行训练,可以在推理中针对不同的麦克风对几何体重新使用模型。使用智能手机在干扰下录制语音的实时数据实验证明了该网络的泛化能力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号