首页> 外文会议>Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI-CON), 2012 9th International Conference on >Effects of acoustic mismatches on speech recognition accuracies due to playback-recorded speech corpus
【24h】

Effects of acoustic mismatches on speech recognition accuracies due to playback-recorded speech corpus

机译:声音不匹配对回放记录的语音语料库造成的语音识别准确性的影响

获取原文
获取原文并翻译 | 示例

摘要

Modern speech recognition techniques rely on large amount of speech data whose acoustic characteristics match with the operating environments to train their acoustic models. Gathering training data from loudspeakers playing recorded speech utterances are far more practical than from human speakers. This paper presents results from speech recognition experiments providing practical insights on effects caused by utterances re-recorded form loudspeakers. A clean-speech corpus of sixty human speakers was built using two different microphones and their playbacks were re-recorded. Results show that, with minimal lexical constraints, accuracies degraded for playback-trained system, even with no mismatches between training and test data. However, mismatches did not affect cases with tighter high-level constraints, such as number and limited-vocabulary word recognitions. A procedure to reduce mismatches caused by constructing corpus from playbacks was introduced. The procedure was shown to make the accuracy of a playback-trained system 48% closer to the one of the system trained with speech in matched environment.
机译:现代语音识别技术依赖于其声学特性与操作环境相匹配的大量语音数据来训练其声学模型。从播放录制的语音的扬声器中收集训练数据比从人类扬声器中收集训练数据要实用得多。本文介绍了语音识别实验的结果,提供了对扬声器重新录制的话语所造成的影响的实用见解。使用两个不同的麦克风构建了一个由60位人类说话者组成的清晰语音的语料库,并重新录制了其回放内容。结果表明,在最小的词法约束下,即使在训练和测试数据之间没有不匹配的情况下,对于经过回放训练的系统,准确性也会下降。但是,不匹配不会影响具有更严格的高级约束的案件,例如数量和词汇限制词识别。介绍了减少因播放而构造语料库而导致的不匹配的过程。结果表明,该程序可使经过回放训练的系统的准确度比在匹配环境中经过语音训练的系统的准确度高48%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号