首页> 外文会议>International Symposium on Natural Language Processing >The NECTEC 2015 Thai Open-Domain Automatic Speech Recognition System
【24h】

The NECTEC 2015 Thai Open-Domain Automatic Speech Recognition System

机译:Nectec 2015泰国开放式自动语音识别系统

获取原文

摘要

We describe the recent development of the NECTEC Thai open-domain automatic speech recognition system. Some of the techniques that were found beneficial over its baseline system are: hybrid word-subword language modeling to enhance the vocabulary coverage in a constraint resource; multi-conditioned noisy acoustic modeling to improve the system robustness using a newly developed large social media speech database; recent state-of-the-art speech features; and lastly, online decoding and speech compression to reduce the processing and data transmission time. These techniques result in a 32.4% word error rate on open-domain noisy speech test sets which is 35.7% relatively lower than the baseline system. The overall system operates in an average 1.2xRT which is promising for real applications.
机译:我们描述了NECTEC泰国开放式自动语音识别系统的最新发展。在其基线系统中有益的一些技术是:混合词语语言建模,以增强约束资源中的词汇覆盖;多条件嘈杂的声学建模,以使用新开发的大型社交媒体语音数据库改善系统鲁棒性;最近的最先进的语言特征;最后,在线解码和语音压缩,以减少处理和数据传输时间。这些技术导致开放式噪声测试集上的32.4%字错误率,比基线系统相对低35.7%。整个系统平均运行1.2xRT,这对真实应用有前途。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号