首页> 外文会议>Spoken Language Technology Workshop >Innovative Bert-Based Reranking Language Models for Speech Recognition
【24h】

Innovative Bert-Based Reranking Language Models for Speech Recognition

机译:基于创新的BERT的Reranking语言模型进行语音识别

获取原文

摘要

More recently, Bidirectional Encoder Representations from Transformers (BERT) was proposed and has achieved impressive success on many natural language processing (NLP) tasks such as question answering and language understanding, due mainly to its effective pre-training then fine-tuning paradigm as well as strong local contextual modeling ability. In view of the above, this paper presents a novel instantiation of the BERT-based contextualized language models (LMs) for use in reranking of N-best hypotheses produced by automatic speech recognition (ASR). To this end, we frame N-best hypothesis reranking with BERT as a prediction problem, which aims to predict the oracle hypothesis that has the lowest word error rate (WER) given the N-best hypotheses (denoted by PBERT). In particular, we also explore to capitalize on task-specific global topic information in an unsupervised manner to assist PBERT in N-best hypothesis reranking (denoted by TPBERT). Extensive experiments conducted on the AMI benchmark corpus demonstrate the effectiveness and feasibility of our methods in comparison to the conventional autoregressive models like the recurrent neural network (RNN) and a recently proposed method that employed BERT to compute pseudo-log-likelihood (PLL) scores for N-best hypothesis reranking.
机译:最近,提出了来自变压器(BERT)的双向编码器表示,并在许多自然语言处理(NLP)任务中取得了令人印象深刻的成功,如问题应答和语言理解,主要是其有效的预训练,然后进行微调范式作为强大的本地语境建模能力。鉴于上述情况,本文介绍了基于BERT的上下文化语言模型(LMS)的新颖实例,用于重新划分由自动语音识别(ASR)产生的N最佳假设。为此,我们将n最佳假设重新登记为伯特作为预测问题,这旨在预测给定具有最低单词错误率(WER)的Oracle假设(由Pbert表示)。特别是,我们还探讨了对特定任务的全球性主题的信息利用在无人监督的方式来帮助PBERT在N最佳假设重新排名(由TPBERT表示)。在AMI基准语料库上进行的广泛实验表明了我们的方法的有效性和可行性与传统的神经网络(RNN)等传统自回归模型相比,以及使用BERT计算伪日志似然(PLL)分数的最近提出的方法对于n最佳假设重新评估。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号