首页> 外文会议>INTERSPEECH 2012 >Voice Activity Detection Using Speech Recognizer Feedback

【24h】

Voice Activity Detection Using Speech Recognizer Feedback

机译：语音活动检测使用语音识别器反馈

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper demonstrates how feedback from a speech recognizer can be leveraged to improve Voice Activity Detection (VAD) for online speech recognition. First, reliably transcribed segments of audio are fed back by the recognizer as supervision for VAD model adaptation. This allows the much stronger LVCSR acoustic models to be harnessed without adding computation. Second, when to make a VAD decision is dictated by the recognizer not the VAD module, allowing an implicit dynamic look-ahead for VAD. This improves robustness but can be gracefully reduced to meet latency requirements if necessary without requiring retraining/retuning of the VAD module. Experiments on telephone conversations yielded a 6.7% abs. reduction in frame classification error rate when feedback was applied to HMM-based VAD and a 4.2% abs. reduction over the best baseline system. Furthermore, a 3.0% abs. WER reduction was achieved over the best baseline in speech recognition experiments.

机译：本文演示了如何利用语音识别器的反馈，以改善在线语音识别的语音活动检测（VAD）。首先，通过识别器作为VAD模型适应的监督，可靠地转录的音频转换段。这允许在不添加计算的情况下利用更强大的LVCSR声学模型。其次，当识别器而不是VAD模块决定时，何时制作VAD决定，允许VAD的隐式动态寻找。这改善了稳健性，但如果需要，可以优雅地减少以满足延迟要求，而无需再培训/重新定量VAD模块。电话交谈的实验产生了6.7％ABS。当反馈应用于基于HMM的VAD和4.2％ABS时，帧分类错误率的降低。减少最好的基线系统。此外，ABS 3.0％。在语音识别实验中最好的基线实现了减少。

著录项

来源
《INTERSPEECH 2012》|2012年||共4页
会议地点
作者
Kit Thambiratnam; Weiwu Zhu; Frank Seide;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 73.4136083;
关键词
voice activity detection (VAD); speech segmentation; speech recognition;

机译：语音活动检测（VAD）;语音分割;语音识别;

相似文献

外文文献
中文文献
专利

1. Speech enhancement through voice activity detection using speech absence probability based on Teager energy [J] . PARKYun-sik, LEE Sang-min 中南大学学报（英文版） . 2013,第002期

机译：通过基于Teager能量的语音缺失概率通过语音活动检测进行语音增强
2. Hidden-Markov-model-based voice activity detector with high speech detection rate for speech enhancement [J] . Veisi H., Sameti H. Signal Processing, IET . 2012,第1期

机译：具有高语音检测率的基于隐马尔可夫模型的语音活动检测器，用于语音增强
3. Voice activity detection in noise using modulation spectrum of speech: Investigation of speech frequency and modulation frequency ranges [J] . Kimhuoch Pek, Takayuki Arai, Noboru Kanedera Acoustical science and technology . 2012,第1期

机译：使用语音调制频谱检测噪声中的语音活动：语音频率和调制频率范围的研究
4. Voice Activity Detection Using Speech Recognizer Feedback [C] . Kit Thambiratnam, Weiwu Zhu, Frank Seide Annual conference of the International Speech Communication Association . 2012

机译：使用语音识别器反馈进行语音活动检测
5. Advances in Audiovisual Speech Processing for Robust Voice Activity Detection and Automatic Speech Recognition [D] . Tao, Fei. 2018

机译：用于鲁棒语音活动检测和自动语音识别的视听语音处理方面的进展
6. A Hierarchical Framework Approach for Voice Activity Detection and Speech Enhancement [O] . Yan Zhang, Zhen-min Tang, Yan-ping Li, -1

机译：语音活动检测和语音增强的分层框架方法
7. Pitch extraction and voiced/unvoiced detection of speech by cross-coupling multi-layered neural network with feedback architecture [O] . Hideo Miyabayashi, Tetsuo Funada 1997

机译：通过具有反馈架构的交叉耦合多层神经网络，俯仰提取和浊音/清晰的语音检测

Voice Activity Detection Using Speech Recognizer Feedback

摘要

著录项

相似文献

相关主题

期刊订阅