Simple and Effective Speech Enhancement for Visual Microphone

机译：视觉麦克风的简单有效语音增强

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Visual microphone is a technique that recovers the sound from a silent video. The simplest way to improve sound recovery performance of the visual microphone is by applying the traditional speech enhancement algorithms which are based on complicated filter designs or sound models. This paper proposes a simple and effective speech enhancement for visual microphone (SEVM) that suppress spectrum components with small amplitude than a predefined threshold value, which exploits the unique properties that the sound spectrum recovered from the visual microphone is relatively high and the noise spectrum generated motion estimation error and damped oscillation is relatively low. The proposed SEVM method can also be easily extended to a multichannel case that multiple speech signals are recovered from multiple cameras. Experimental results show the proposed SEVM method better performance than the traditional speech enhancement algorithms in terms of log-likelihood ratio (LLR), signal to noise ratio (SNR), segmental SNR (SegSNR) and cepstral distance (CEP). From these results, we convince that the proposed SEVM method that is adapted to the visual microphone is really simple and effective than the traditional speech enhancement methods that are just extended to the visual microphone as a post-processing.

机译：可视麦克风是一种从静音视频中恢复声音的技术。改善视觉麦克风的声音恢复性能的最简单方法是应用基于复杂滤波器设计或声音模型的传统语音增强算法。本文提出了一种简单有效的视觉麦克风语音增强（SEVM）技术，可以抑制振幅小于预定义阈值的频谱分量，该技术利用了从视觉麦克风回收的声音频谱相对较高且产生的噪声频谱的独特特性。运动估计误差和阻尼振荡相对较低。所提出的SEVM方法还可以轻松扩展到从多个摄像机恢复多个语音信号的多通道情况。实验结果表明，提出的SEVM方法在对数似然比（LLR），信噪比（SNR），分段SNR（SegSNR）和倒谱距离（CEP）方面优于传统语音增强算法。从这些结果中，我们相信，与仅作为后处理扩展到可视麦克风的传统语音增强方法相比，适用于可视麦克风的SEVM方法确实非常简单和有效。

著录项

来源
《IAPR Asian Conference on Pattern Recognition》|2017年|694-699|共6页
会议地点 Nanjing(CN)
作者
Juhyun Ahn; Daijin Kim;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Speech enhancement; Visualization; Microphones; Cameras; Noise measurement; Oscillators; Motion estimation;

机译：语音增强；可视化；麦克风；相机；噪声测量；振荡器；运动估计;

相似文献

外文文献
中文文献
专利

1. Evaluation of speech reception threshold in noise in young Cochlear? Nucleus ? system 6 implant recipients using two different digital remote microphone technologies and a speech enhancement sound processing algorithm [J] . Sergio Razza, Monica Zaccone, Aannalisa Meli, International journal of pediatric otorhinolaryngology . 2017,第期

机译：在年轻耳蜗噪声中评估语音接收阈值？核心？系统6使用两种不同的数字远程麦克风技术和语音增强声处理算法的植入接收者
2. Evaluation of Speech Recognition of Cochlear Implant Recipients Using Adaptive, Digital Remote Microphone Technology and a Speech Enhancement Sound Processing Algorithm [J] . Wolfe Jace, Morais Mila, Schafer Erin, Journal of the American Academy of Audiology . 2015,第5期

机译：使用自适应数字远程麦克风技术和语音增强声音处理算法评估人工耳蜗接收者的语音识别
3. Speech Enhancement Using Compact Microphone Array and Applications in Distant Speech Acquisition [J] . ZHANG Heng, FU Qiang, YAN Yonghong 电子学报：英文版 . 2009,第003期

机译：使用紧凑型麦克风阵列进行语音增强及其在远程语音采集中的应用
4. Simple and Effective Speech Enhancement for Visual Microphone [C] . Juhyun Ahn, Daijin Kim IAPR Asian Conference on Pattern Recognition . 2017

机译：视觉麦克风简单有效的语音增强
5. Robust speech processing based on microphone array, audio-visual, and frame selection for in-vehicle speech recognition and in-set speaker recognition. [D] . Zhang, Xianxian. 2005

机译：基于麦克风阵列，视听和帧选择的强大语音处理功能，可实现车载语音识别和内置说话人识别。
6. A Real-Time Dual-Microphone Speech Enhancement Algorithm Assisted by Bone Conduction Sensor [O] . Yi Zhou, Yufan Chen, Yongbao Ma, 2020

机译：骨传导传感器辅助的实时双麦克风语音增强算法
7. Correction to: An integrated MVDR beamformer for speech enhancement using a local microphone array and external microphones [O] . Randall Ali, Toon van Waterschoot, Marc Moonen 2021

机译：校正：使用本地麦克风阵列和外部麦克风的语音增强集成MVDR波束形成器

Simple and Effective Speech Enhancement for Visual Microphone

摘要

著录项

相似文献

相关主题

期刊订阅