首页> 外文会议>IEE Colloquium on Applied Statistical Process Control, 1990 >The effect of speech and audio compression on speech recognitionperformance

【24h】

The effect of speech and audio compression on speech recognitionperformance

机译：语音和音频压缩对语音识别性能的影响

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper proposes an in-depth look at the influence of differentspeech and audio codecs on the performance of our continuous speechrecognition engine. GSM full rate, G711, G723.1 and MPEG coders areinvestigated. It is shown that MPEG transcoding degrades the speechrecognition performance for low bitrates whereas performance remainsacceptable for specialized speech coders like GSM or G711. A newstrategy is proposed to cope with degradation due to low bitrate coding.The acoustic models of the speech recognition system are trained withtranscoded speech (one acoustic model for each speech/audio codec).First results show that this strategy allows one to recover acceptableperformance

机译：本文提出了对不同影响的深入研究语音和音频编解码器对我们连续语音的性能影响识别引擎。 GSM全速率，G711，G723.1和MPEG编码器调查。结果表明，MPEG转码会降低语音质量低比特率的识别性能，而性能保持不变专用语音编码器（如GSM或G711）可以接受。一个新的提出了一种策略来应对由于低比特率编码而引起的降级。语音识别系统的声学模型经过训练转码语音（每个语音/音频编解码器的一个声学模型）。初步结果表明，该策略可以使人们恢复可接受的水平表现

著录项

来源
《IEE Colloquium on Applied Statistical Process Control, 1990 》|1990年|p.301-306|共6页
会议地点
作者
Besacier L.; Bergamini C.; Vaufreydaz D.; Castelli E.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类自动化技术、计算机技术 ;
关键词

相似文献

外文文献
中文文献
专利

1. The cortical representation of the speech envelope is earlier for audiovisual speech than audio speech [J] . Michael J. Crosse Edmund C. Lalor Journal of Neurophysiology . 2014 ,第4期

机译：对于视听语音，语音包络的皮质表示早于音频语音
2. The cortical representation of the speech envelope is earlier for audiovisual speech than audio speech [J] . Michael J. Crosse Edmund C. Lalor Journal of Neurophysiology . 2014 ,第4期

机译：语音信封的皮质代表性比音频语音更早用于视听演讲
3. No, There Is No 150 ms Lead of Visual Speech on Auditory Speech, but a Range of Audiovisual Asynchronies Varying from Small Audio Lead to Large Audio Lag [J] . Jean-Luc Schwartz, Christophe Savariaux PLoS Computational Biology . 2014 ,第7期

机译：不，听觉语音没有150 ms的视觉语音引导，但是视听异步范围从小音频导致大音频滞后
4. The effect of speech and audio compression on speech recognition performance [C] . Besacier, L., Bergamini, . 2001

机译：语音和音频压缩对语音识别性能的影响
5. Lossless audio compression of speech and voice. [D] . Liu, Yang. 2002

机译：语音和语音的无损音频压缩。
6. No There Is No 150 ms Lead of Visual Speech on Auditory Speech but a Range of Audiovisual Asynchronies Varying from Small Audio Lead to Large Audio Lag [O] . Jean-Luc Schwartz, Christophe Savariaux 2014

机译：不听觉语音没有150 ms的视觉语音导联但是视听异步范围从小音频导联到大音频滞后
7. The Effect Of Speech And Audio Compression On Speech Recognition Performance [O] . Laurent Besacier, Carole Bergamini, Dominique Vaufreydaz, 2001

机译：语音和音频压缩对语音识别性能的影响

The effect of speech and audio compression on speech recognitionperformance

摘要

著录项

相似文献

相关主题

期刊订阅