首页> 外文期刊>Systems and Computers in Japan >Recognition of Speech from Live Sports Coverage Using Acoustic and Language Model Adaptation
【24h】

Recognition of Speech from Live Sports Coverage Using Acoustic and Language Model Adaptation

机译:使用声学和语言模型自适应从实时体育报道中识别语音

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper we use large vocabulary continuous speech recognition to detect highlight scenes in live radio coverage of baseball games and extract information for indexing (keywords). To make the speech recognition unit more robust we perform MLLR and MAP adaptation of the acoustic models proposing a two-level adaptation procedure using both supervised and unsupervised adaptation. By performing acoustic model adaptation, we are able to adapt to the announcer's speaking style and the recording environment, achieving an improvement of 28% in word accuracy over the baseline. Regarding the language model, we performed adaptation via language model fusion, created classes for players' and commentators' names, and adjusted the pronunciation dictionary, resulting in an improvement of 13% in word accuracy over the baseline. By combined adaptation of both models we achieved an improvement of 38% in word accuracy.
机译:在本文中,我们使用大词汇量连续语音识别技术来检测棒球比赛实况广播中的精彩场面,并提取索引信息(关键字)。为了使语音识别单元更健壮,我们对声学模型执行MLLR和MAP自适应,提出了使用监督和非监督自适应的两级自适应程序。通过执行声学模型调整,我们可以适应播音员的讲话风格和录音环境,与基线相比,单词准确性提高了28%。关于语言模型,我们通过语言模型融合进行了改编,为演奏者和评论者的名字创建了类,并调整了发音词典,从而使词的准确性比基线提高了13%。通过两种模型的组合改编,我们在单词准确性方面提高了38%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号