首页> 外文会议>International Conference on Creative Content Technologies >RPKOM-GEN: A System for Testing Speech Recognition in Adverse Acoustic Conditions Using Speech Synthesis
【24h】

RPKOM-GEN: A System for Testing Speech Recognition in Adverse Acoustic Conditions Using Speech Synthesis

机译:RPKOM-GEN:使用语音合成在不利声学条件下测试语音识别的系统

获取原文

摘要

Training and testing of current state-of-the-art speech recognition systems require huge speech databases whose creation is time-consuming and expensive. This paper presents a novel approach for testing speech recognition in adverse acoustic conditions that uses speech synthesis, which facilitates optimizing and adjusting speech recognition to various environmental conditions. RPKOM-GEN is a complex system of multiple synthesizers that generates synthetic speech and testing signals with well defined characteristics. It might be used to produce public announcements, sets of utterances for spoken dialogue systems or other speech excerpts. The acoustic parameters of synthetic voices, such as speech rate, pitch, intensity, and others, can be pre-defined from a broad range of options. By using this novel technique, the system can also vary vocal effort imitating thus the Lombard effect and so-called long-distance speech. It is also possible to model the characteristics of the transmission channel since the system includes noise generators and digital effects such as the setting of environmental noise or reverberation levels. The paper presents the system architecture, describes graphical user interface and a rich array of usage possibilities, and discusses the results of pilot experiments testing the effect of added noise on speech recognition accuracy.
机译:目前最先进的语音识别系统的培训和测试需要巨大的语音数据库,其创建是耗时和昂贵的。本文介绍了一种用于测试使用语音合成的不良声学条件中的语音识别的新方法,这有利于优化和调整语音识别到各种环境条件。 RPKOM-GEN是一种复杂的多个合成器系统,可产生具有良好定义特性的合成语音和测试信号。它可能用于发布公告,用于口语对话系统或其他演讲摘录的话语。合成声音的声学参数,例如语音率,俯仰,强度等,可以从广泛的选择中预定。通过使用这种新颖的技术,该系统还可以改变模仿伦巴第效应和所谓的长距离语音的声乐效果。由于该系统包括噪声发生器和数字效果,例如环境噪声或混响级别的设置,因此还可以模拟传输信道的特性。本文提出了系统架构,描述了图形用户界面和丰富的使用量,并讨论了试验实验的结果测试了噪声对语音识别准确性的效果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号