Converting Neural Network Language Models into back-off language models for efficient decoding in automatic speech recognition

机译：将神经网络语言模型转换为退避语言模型，以在自动语音识别中进行有效解码

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Neural Network Language Models (NNLMs) have achieved very good performance in large-vocabulary continuous speech recognition (LVCSR) systems. Because decoding with NNLMs is very computationally expensive, there is interest in developing methods to approximate NNLMs with simpler language models that are suitable for fast decoding. In this work, we propose an approximate method for converting a feedforward NNLM into a back-off n-gram language model that can be used directly in existing LVCSR decoders. We convert NNLMs of increasing order to pruned back-off language models, using lower-order models to constrain the n-grams allowed in higher-order models. In experiments on Broadcast News data, we find that the resulting back-off models retain the bulk of the gain achieved by NNLMs over conventional n-gram language models, and give significant accuracy improvements as compared to existing methods for converting NNLMs to back-off models. In addition, the proposed approach can be applied to any type of non-back-off language model to enable efficient decoding.

机译：在大型词汇连续语音识别（LVCSR）系统中，神经网络语言模型（NNLM）取得了很好的性能。由于使用NNLM进行解码在计算上非常昂贵，因此有兴趣开发一种方法，以适合于快速解码的简单语言模型来近似NNLM。在这项工作中，我们提出了一种将前馈NNLM转换为可直接在现有LVCSR解码器中使用的退避n-gram语言模型的近似方法。我们使用低阶模型约束高阶模型中允许的n元语法，将升序的NNLM转换为修剪后退语言模型。在对广播新闻数据进行的实验中，我们发现，与传统的将NNLM转换为退避方法相比，所产生的退避模型保留了NNLM在传统n-gram语言模型上所获得的大部分收益，并且显着提高了准确性。楷模。另外，所提出的方法可以应用于任何类型的非退避语言模型以实现有效的解码。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2013年|8242-8246|共5页
会议地点
作者
Arisoy Ebru; Chen Stanley F.; Ramabhadran Bhuvana; Sethy Abhinav;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Neural network language models; decoding with neural network language models;

机译：神经网络语言模型;使用神经网络语言模型进行解码;

相似文献

外文文献
中文文献
专利

1. Investigation of Automatic Speech Recognition Systems via the Multilingual Deep Neural Network Modeling Methods for a Very Low-Resource Language, Chaha [J] . Tessfu Geteye Fantaye, Junqing Yu, Tulu Tilahun Hailu Journal of Signal and Information Processing . 2020,第1期

机译：Chaha非常低于资源语言的多语言深神经网络建模方法对自动语音识别系统的研究
2. Investigation of Automatic Speech Recognition Systems via the Multilingual Deep Neural Network Modeling Methods for a Very Low-Resource Language, Chaha [J] . Tessfu Geteye Fantaye, Junqing Yu, Tulu Tilahun Hailu 信号与信息处理（英文） . 2020,第001期

机译：资源非常少的语言Chaha通过多语言深层神经网络建模方法研究自动语音识别系统
3. Latent Words Recurrent Neural Network Language Models for Automatic Speech Recognition [J] . Ryo MASUMURA, Taichi ASAMI, Takanobu OBA, IEICE transactions on information and systems . 2019,第12期

机译：潜在词递归神经网络语言模型用于自动语音识别
4. CONVERTING NEURAL NETWORK LANGUAGE MODELS INTO BACK-OFF LANGUAGE MODELS FOR EFFICIENT DECODING IN AUTOMATIC SPEECH RECOGNITION [C] . Ebru Arisoy, Stanley F. Chen, Bhuvana Ramabhadran, International Conference on Acoustics, Speech and Signal Processing . 2013

机译：将神经网络语言模型转换为自动语音识别中有效解码的退避语言模型
5. Arabic language modeling with stem-derived morphemes for automatic speech recognition. [D] . Heintz, Ilana. 2010

机译：具有词干衍生语素的阿拉伯语言建模，可实现自动语音识别。
6. Transformer-based deep neural network language models for Alzheimer’s disease risk assessment from targeted speech [O] . Alireza Roshanzamir, Hamid Aghajan, Mahdieh Soleymani Baghshah 2021

机译：基于变压器的深神经网络语言模型用于阿尔茨海默病风险评估来自目标言论
7. Latent Words Recurrent Neural Network Language Models for Automatic Speech Recognition [O] . Ryo MASUMURA, Taichi ASAMI, Takanobu OBA, 2019

机译：潜在的自动语音识别复发性神经网络语言模型

Converting Neural Network Language Models into back-off language models for efficient decoding in automatic speech recognition

摘要

著录项

相似文献

相关主题

期刊订阅