首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >On the characteristics effects of loudness during utterance production in continuous speech recognition
【24h】

On the characteristics effects of loudness during utterance production in continuous speech recognition

机译:在连续语音识别中发声生产中响度的特点和影响

获取原文

摘要

We have checked out that, in speech recognition based telephone applications, the loudness with which the speech signal is produced is a source of degradation of the word accuracy if it is lower or higher than normal. For this reason, we havecarried out a research work which has reached three goals: (a) get a better understanding of the Speech Production Loudness (SPL) phenomenon, (b) find out the parameters of the speech recognizer that are the most affected by loudness variations, and (c)compute the effects of SPL and whispery speech in Large Vocabulary Continuous Speech Recognition (LVCSR). In this paper we report the results of this study for three different loudnesses (low, normal and high) and whispery speech. We also report the wordaccuracy degradation of a continuous speech recognition system when the speech production loudness is different than normal as well as the degradation for whispery speech. The study has been done using the TRESYOL Spanish database, that was designed tostudy, evaluate and compensate the effect of loudness and whispery speech in LVCSR systems.
机译:我们已经检查过,在基于语音识别的电话应用中,如果它较低或高于正常,则产生语音信号的响度是字精度的劣化源。出于这个原因,我们还有一个达到三个目标的研究工作:(a)更好地了解语音生产响度(SPL)现象,(b)找出最受影响最大的语音识别器的参数响度变化,和(c)计算SPL和潜言语音在大词汇连续语音识别(LVCSR)中的影响。在本文中,我们向三种不同的响度(低,正常和高)和耳语的演讲报告了本研究的结果。我们还报告了当语音生产响度与正常情况不同的持续语音识别系统的语音劣化,以及潜言语音的劣化。该研究已经使用Tresyol西班牙数据库进行了完成,该数据库被设计得扭曲,评估和补偿LVCSR系统中的响度和耳语的效果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号