首页> 外文会议>IEEE International Conference on Rebooting Computing >Multi-microphone voice activity and single-talk detectors based on steered-response power output entropy
【24h】

Multi-microphone voice activity and single-talk detectors based on steered-response power output entropy

机译:基于转向响应功率输出熵的多麦克风语音活动和单通话检测器

获取原文

摘要

Voice activity detection (VAD), namely determining whether a speech signal is active or inactive, and single talk detector (STD), namely detecting that only one speaker is active, are important building blocks in many speech processing applications. A speaker-localization stage (such as the steered response power (SRP)) is often concurrently implemented on the same device.In this paper, the spatial properties of the SRP are utilized for improving the performance of both the voice activity detector (VAD) and the STD. We propose to measure the entropy at the SRP output and compare with the typical entropy of noise-only frames. This feature utilizes spatial information and may therefore become advantageous in nonstationary noise environments. The STD can then be implemented by determining local minimum values of the entropy measure of the SRP.The proposed VAD was tested for a single speaker with two cases, directional background noise with changing level and with a background music source. The proposed STD was tested using real recordings of two concurrent speakers.
机译:在许多语音处理应用中,语音活动检测(VAD)(即确定语音信号是处于活动状态还是非活动状态)和单语音检测器(STD)(即检测到只有一个扬声器处于活动状态)是重要的构成部分。通常在同一设备上同时实现扬声器定位阶段(例如,转向响应功率(SRP))。在本文中,SRP的空间特性被用于改善两个语音活动检测器(VAD)的性能。和性病。我们建议在SRP输出端测量熵,并将其与纯噪声帧的典型熵进行比较。此功能利用空间信息,因此在非平稳噪声环境中可能会变得很有优势。然后可以通过确定SRP熵测度的局部最小值来实现STD。针对具有两种情况的定向单个扬声器测试了建议的VAD,这两种情况是具有变化级别的定向背景噪声和背景音乐源。拟议的性病使用两个并发发言人的真实录音进行了测试。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号