首页> 外文会议>IEE Colloquium on Applied Statistical Process Control, 1990 >The effect of speech and audio compression on speech recognitionperformance
【24h】

The effect of speech and audio compression on speech recognitionperformance

机译:语音和音频压缩对语音识别性能的影响

获取原文

摘要

This paper proposes an in-depth look at the influence of differentspeech and audio codecs on the performance of our continuous speechrecognition engine. GSM full rate, G711, G723.1 and MPEG coders areinvestigated. It is shown that MPEG transcoding degrades the speechrecognition performance for low bitrates whereas performance remainsacceptable for specialized speech coders like GSM or G711. A newstrategy is proposed to cope with degradation due to low bitrate coding.The acoustic models of the speech recognition system are trained withtranscoded speech (one acoustic model for each speech/audio codec).First results show that this strategy allows one to recover acceptableperformance
机译:本文提出了对不同影响的深入研究 语音和音频编解码器对我们连续语音的性能影响 识别引擎。 GSM全速率,G711,G723.1和MPEG编码器 调查。结果表明,MPEG转码会降低语音质量 低比特率的识别性能,而性能保持不变 专用语音编码器(如GSM或G711)可以接受。一个新的 提出了一种策略来应对由于低比特率编码而引起的降级。 语音识别系统的声学模型经过训练 转码语音(每个语音/音频编解码器的一个声学模型)。 初步结果表明,该策略可以使人们恢复可接受的水平 表现

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号