首页> 外文会议>Annual conference of the International Speech Communication Association >PodCastle: Collaborative Training of Language Models on the Basis of Wisdom of Crowds
【24h】

PodCastle: Collaborative Training of Language Models on the Basis of Wisdom of Crowds

机译:PodCastle:基于人群智慧的语言模型协作培训

获取原文

摘要

This paper presents a language-model training method for improving automatic transcription of online spoken contents. Unlike previously studied LVCSR tasks such as broadcast news and lectures, large-sized task-specific corpora for training language models cannot be prepared and used in recognition because of the diversity of topics, vocabularies, and speaking styles. To overcome difficulties in preparing such task-specific language models in advance, we propose collaborative training of language models on the basis of wisdom of crowds. On our public web service for LVCSR-based spoken document retrieval PodCastle, over half a million recognition errors were corrected by anonymous users. By leveraging such corrected transcriptions, component language models for various topics can be built and dynamically mixed to generate an appropriate language model for each podcast episode in an unsupervised manner. Experimental results with Japanese podcasts showed that the mixed languages models significantly reduced the word error rate.
机译:本文提出了一种用于改进在线口语内容自动转录的语言模型训练方法。与以前研究的LVCSR任务(例如广播新闻和演讲)不同,用于培训语言模型的大型任务专用语料库由于主题,词汇和说话风格的多样性而无法准备并用于识别。为了克服事先准备此类任务特定语言模型的困难,我们建议在人群智慧的基础上进行协作训练语言模型。在基于LVCSR的语音文档检索PodCastle的公共网络服务上,匿名用户已纠正了超过一百万的识别错误。通过利用这种经过纠正的转录,可以构建各种主题的组件语言模型并将其动态混合,从而以无监督的方式为每个播客情节生成合适的语言模型。日语播客的实验结果表明,混合语言模型显着降低了单词错误率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号