首页> 外文会议>9th International conference on language resources and evaluation >Exploiting the large-scale German Broadcast Corpus to boost the Fraunhofer IAIS Speech Recognition System
【24h】

Exploiting the large-scale German Broadcast Corpus to boost the Fraunhofer IAIS Speech Recognition System

机译:利用大型德国广播语料库来增强Fraunhofer IAIS语音识别系统

获取原文

摘要

In this paper we describe the large-scale German broadcast corpus (GER-TV1000h) containing more than 1,000 hours of transcribed speech data. This corpus is unique in the German language corpora domain and enables significant progress in tuning the acoustic modelling of German large vocabulary continuous speech recognition (LVCSR) systems. The exploitation of this huge broadcast corpus is demonstrated by optimizing and improving the Fraunhofer IAIS speech recognition system. Due to the availability of huge amount of acoustic training data new training strategies are investigated. The performance of the automatic speech recognition (ASR) system is evaluated on several datasets and compared to previously published results. It can be shown that the word error rate (WER) using a larger corpus can be reduced by up to 9.1 % relative. By using both larger corpus and recent training paradigms the WER was reduced by up to 35.8 % relative and below 40 % absolute even for spontaneous dialectal speech in noisy conditions, making the ASR output a useful resource for subsequent tasks like named entity recognition also in difficult acoustic situations.
机译:在本文中,我们描述了包含超过1000个小时的转录语音数据的大型德国广播语料库(GER-TV1000h)。该语料库在德语语料库领域中是独一无二的,并且在调整德国大型词汇连续语音识别(LVCSR)系统的声学模型方面取得了重大进展。通过优化和改进Fraunhofer IAIS语音识别系统,证明了对这一庞大的广播语料库的利用。由于可获得大量的声学训练数据,因此研究了新的训练策略。自动语音识别(ASR)系统的性能在多个数据集上进行了评估,并与先前发布的结果进行了比较。可以证明,使用较大语料库的单词错误率(WER)相对最多可降低9.1%。通过同时使用较大的语料库和最近的训练范例,即使在嘈杂的条件下自发的方言语音,WER相对值也可降低高达35.8%,绝对值低于40%,这使得ASR输出可用于后续任务(如命名实体识别)的有用资源声学情况。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号