首页> 外文会议>International Conference on Asian Language Processing >N-gram Language Model Based on Multi-Word Expressions in Web Documents for Speech Recognition and Closed-Captioning
【24h】

N-gram Language Model Based on Multi-Word Expressions in Web Documents for Speech Recognition and Closed-Captioning

机译:Web文档中基于多词表达的N元语法模型用于语音识别和隐藏字幕

获取原文

摘要

Automatic speech recognition technique is generally used to align the closed caption text to video data. It is important to increase the speech recognition accuracy for the accurate closed-captioning. This paper proposes the method for constructing N-gram language model based on multi word expressions (MWEs) from web retrieval results to improve the speech recognition performance. The web retrieval experiment for examining the distribution of web count numbers for MWEs and the speech recognition experiment for investigating the effectiveness of MWEs are conducted. The experimental results show that the proposed method can improve the recognition performance and the closed-captioning accuracy.
机译:自动语音识别技术通常用于将隐藏字幕文本与视频数据对齐。对于准确的隐藏式字幕来说,提高语音识别的准确性非常重要。提出了一种基于网页检索结果的基于多词表达(MWE)的N元语法模型的构建方法,以提高语音识别性能。进行了用于检查MWE的Web计数数量分布的Web检索实验和用于调查MWE有效性的语音识别实验。实验结果表明,该方法可以提高识别性能和隐藏字幕的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号