首页> 外文会议>IAPR Asian Conference on Pattern Recognition >Simple and Effective Speech Enhancement for Visual Microphone
【24h】

Simple and Effective Speech Enhancement for Visual Microphone

机译:视觉麦克风的简单有效语音增强

获取原文

摘要

Visual microphone is a technique that recovers the sound from a silent video. The simplest way to improve sound recovery performance of the visual microphone is by applying the traditional speech enhancement algorithms which are based on complicated filter designs or sound models. This paper proposes a simple and effective speech enhancement for visual microphone (SEVM) that suppress spectrum components with small amplitude than a predefined threshold value, which exploits the unique properties that the sound spectrum recovered from the visual microphone is relatively high and the noise spectrum generated motion estimation error and damped oscillation is relatively low. The proposed SEVM method can also be easily extended to a multichannel case that multiple speech signals are recovered from multiple cameras. Experimental results show the proposed SEVM method better performance than the traditional speech enhancement algorithms in terms of log-likelihood ratio (LLR), signal to noise ratio (SNR), segmental SNR (SegSNR) and cepstral distance (CEP). From these results, we convince that the proposed SEVM method that is adapted to the visual microphone is really simple and effective than the traditional speech enhancement methods that are just extended to the visual microphone as a post-processing.
机译:可视麦克风是一种从静音视频中恢复声音的技术。改善视觉麦克风的声音恢复性能的最简单方法是应用基于复杂滤波器设计或声音模型的传统语音增强算法。本文提出了一种简单有效的视觉麦克风语音增强(SEVM)技术,可以抑制振幅小于预定义阈值的频谱分量,该技术利用了从视觉麦克风回收的声音频谱相对较高且产生的噪声频谱的独特特性。运动估计误差和阻尼振荡相对较低。所提出的SEVM方法还可以轻松扩展到从多个摄像机恢复多个语音信号的多通道情况。实验结果表明,提出的SEVM方法在对数似然比(LLR),信噪比(SNR),分段SNR(SegSNR)和倒谱距离(CEP)方面优于传统语音增强算法。从这些结果中,我们相信,与仅作为后处理扩展到可视麦克风的传统语音增强方法相比,适用于可视麦克风的SEVM方法确实非常简单和有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号