首页> 外文期刊>Selected Topics in Signal Processing, IEEE Journal of >Voice Activity Detection for Transient Noisy Environment Based on Diffusion Nets
【24h】

Voice Activity Detection for Transient Noisy Environment Based on Diffusion Nets

机译:基于扩散网络的瞬态噪声环境语音活动检测

获取原文
获取原文并翻译 | 示例
           

摘要

We address voice activity detection in acoustic environments of transients and stationary noises, which often occur in real-life scenarios. We exploit unique spatial patterns of speech and non-speech audio frames by independently learning their underlying geometric structure. This process is done through a deep encoder-decoder-based neural network architecture. This structure involves an encoder that maps spectral features with temporal information to their low-dimensional representations, which are generated by applying the diffusion maps method. The encoder feeds a decoder that maps the embedded data hack into the high-dimensional space. A deep neural network, which is trained to separate speech from non-speech frames, is obtained by concatenating the decoder to the encoder, resembling the known diffusion nets architecture. Experimental results show enhanced performance compared to competing voice activity detection methods. The improvement is achieved in both accuracy, robustness, and generalization ability. Our model performs in a real-time manner and can be integrated into audio-based communication systems. We also present a batch algorithm that obtains an even higher accuracy for offline applications.
机译:我们致力于在瞬态和固定噪声的声学环境中进行语音活动检测,这在现实生活中经常发生。通过独立学习它们的基本几何结构,我们可以利用语音和非语音音频帧的独特空间模式。这个过程是通过基于深度编码器-解码器的神经网络架构完成的。该结构涉及一种编码器,该编码器将具有时间信息的频谱特征映射到它们的低维表示,这些维表示是通过应用扩散图方法生成的。编码器提供给解码器,该解码器将嵌入式数据hack映射到高维空间。通过将解码器连接到编码器,获得了经过训练以将语音与非语音帧分离的深度神经网络,类似于已知的扩散网络架构。实验结果表明,与竞争性语音活动检测方法相比,性能得到了增强。准确性,鲁棒性和泛化能力均得到了改善。我们的模型以实时方式执行,并且可以集成到基于音频的通信系统中。我们还提出了一种批处理算法,该算法为离线应用程序提供了更高的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号