首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Fast Oov Words Incorporation Using Structured Word Embeddings for Neural Network Language Model
【24h】

Fast Oov Words Incorporation Using Structured Word Embeddings for Neural Network Language Model

机译:FAST OOV单词使用结构化单词嵌入式用于神经网络语言模型

获取原文

摘要

Recently, deep learning approaches have been widely used in language modeling and achieved great success. However, the out-of-vocabulary (OOV) words are often estimated in a rather crude way using only one special symbol, which ignores the linguistic information. In this paper we present an LSTM language model with structured word embeddings to tackle this problem. In our model, both input and output embeddings of LSTM language model are deployed with structured word embeddings. Utilizing syntactic-level and morphological-level parameters sharing, OOV words can be incorporated into the proposed model without retraining. The LSTM language model with structured word embeddings is instantiated for Chinese. Experiments show that the proposed model achieves PPL improvement on OOV words, and can be further integrated into automatic speech recognition systems for fast vocabulary updating.
机译:最近,深入学习方法已被广泛用于语言建模,取得了巨大的成功。然而,只使用一个特殊符号以相当粗略的方式估计失败的词汇(OOV)单词,这忽略了语言信息。在本文中,我们介绍了一个具有结构化词嵌入的LSTM语言模型来解决这个问题。在我们的模型中,LSTM语言模型的两个输入和输出嵌入式都会使用结构化单词嵌入式部署。利用句法级和形态学级参数共享,OOV单词可以在没有再培训的情况下纳入所提出的模型。具有结构化单词嵌入式的LSTM语言模型是为了中文而实例化。实验表明,所提出的模型实现了对OOV字的PPL改进,并且可以进一步集成到用于快速词汇更新的自动语音识别系统中。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号