首页> 外文会议> >Unsupervised Speech/Non-Speech Detection for Automatic Speech Recognition in Meeting Rooms

【24h】

Unsupervised Speech/Non-Speech Detection for Automatic Speech Recognition in Meeting Rooms

机译：会议室自动语音识别的无监督语音/非语音检测

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The goal of this work is to provide robust and accurate speech detection for automatic speech recognition (ASR) in meeting room settings. The solution is based on computing long-term modulation spectrum, and examining specific frequency range for dominant speech components to classify speech and non-speech signals for a given audio signal. Manually segmented speech segments, short-term energy, short-term energy and zero-crossing based segmentation techniques, and a recently proposed multi layer perceptron (MLP) classifier system are tested for comparison purposes. Speech recognition evaluations of the segmentation methods are performed on a standard database and tested in conditions where the signal-to-noise ratio (SNR) varies considerably, as in the cases of close-talking headset, lapel, distant microphone array output, and distant microphone. The results reveal that the proposed method is more reliable and less sensitive to mode of signal acquisition and unforeseen conditions

机译：这项工作的目标是为会议室设置中的自动语音识别（ASR）提供可靠而准确的语音检测。该解决方案基于计算长期调制频谱，并检查主要语音分量的特定频率范围，以对给定音频信号进行语音和非语音信号分类。为了进行比较，测试了手动分段的语音分段，短期能量，基于短期能量和零交叉的分段技术以及最近提出的多层感知器（MLP）分类器系统。在标准数据库上执行对分割方法的语音识别评估，并在信噪比（SNR）发生较大变化的条件下进行测试，例如在近距离交谈耳机，翻领，远距离麦克风阵列输出和远距离麦克风的情况下麦克风。结果表明，所提出的方法对信号采集模式和不可预见的条件更加可靠，灵敏度更低。

著录项

来源
《》|2007年|1037-1040|共4页
会议地点
作者
Maganti; H.K.; Motlicek; P.; Gatica-Perez; D.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
acoustic signal detection; architectural acoustics; multilayer perceptrons; speech processing; speech recognition; SNR; automatic recognition; automatic speech recognition; meeting rooms; multi layer perceptron; nonspeech signals; segmentation methods; signal acquisi;

机译：声学信号检测;建筑声学;多层感知器;语音处理;语音识别; SNR;自动识别;自动语音识别;会议室;多层感知器;非语音信号;分段方法;信号获取;

相似文献

外文文献
中文文献
专利

1. Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition [J] . Shimada Kazuki, Bando Yoshiaki, Mimura Masato, Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2019,第5期

机译：基于多通道NMF信息波束形成的无监督语音增强技术，用于强噪声自动语音识别
2. Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition [J] . Shimada Kazuki, Bando Yoshiaki, Mimura Masato, Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2019,第5期

机译：基于多通道NMF的噪声强度自动语音识别的无监督语音增强
3. The relationship between speech recognition in noise and non-speech recognition in noise test performances: Implications for central auditory processing disorders testing [J] . Vermiglio Andrew J., Velappan Keerthana, Heeke Paige, Journal of communication disorders . 2019,第期

机译：噪声测试表演中噪声识别与非语音识别的关系：中央听觉处理障碍测试的影响
4. Unsupervised Speech/Non-Speech Detection for Automatic Speech Recognition in Meeting Rooms [C] . Maganti H.K., Motlicek P., Gatica-Perez D. . -1

机译：会议室自动语音识别的无监督语音/非语音检测
5. Advances in Audiovisual Speech Processing for Robust Voice Activity Detection and Automatic Speech Recognition [D] . Tao, Fei. 2018

机译：用于鲁棒语音活动检测和自动语音识别的视听语音处理方面的进展
6. A Speech Recognition-based Solution for the Automatic Detection of Mild Cognitive Impairment from Spontaneous Speech [O] . László Tóth, Ildikó Hoffmann, Gábor Gosztolya, -1

机译：基于语音识别的自发性语音自动检测轻度认知障碍的解决方案
7. Unsupervised speech/non-speech detection for automatic speech recognition in meeting rooms [O] . Hari Krishna Maganti, Petr Motlicek, Daniel Gatica-perez 2007

机译：用于会议室中自动语音识别的无监督语音/非语音检测
8. Recognition of Isolated Non-Speech Sounds [R] . Ballas, J. A. 1987

机译：识别孤立的非语音声音

Unsupervised Speech/Non-Speech Detection for Automatic Speech Recognition in Meeting Rooms

摘要

著录项

相似文献

相关主题

期刊订阅