Slim Embedding Layers for Recurrent Neural Language Models

机译：用于复发性神经语言模型的纤薄嵌入层

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recurrent neural language models are the state-of-the-art models for language modeling. When the vocabulary size is large, the space taken to store the model parameters becomes the bottleneck for the use of recurrent neural language models. In this paper, we introduce a simple space compression method that randomly shares the structured parameters at both the input and output embedding layers of the recurrent neural language models to significantly reduce the size of model parameters, but still compactly represent the original input and output embedding layers. The method is easy to implement and tune. Experiments on several data sets show that the new method can get similar perplexity and BLEU score results while only using a very tiny fraction of parameters.

机译：经常性的神经语言模型是语言建模的最先进模型。当词汇大小很大时，存储模型参数所采取的空间成为使用经常性神经语言模型的瓶颈。在本文中，我们介绍了一个简单的空间压缩方法，随机分享了经常性神经语言模型的输入和输出嵌入层的结构化参数，以显着降低模型参数的大小，但仍然是最初的输入和输出嵌入层。该方法易于实现和调整。几个数据集的实验表明，新方法可以获得类似的困惑和BLEU得分结果，同时仅使用非常小的参数分数。

著录项

来源
《AAAI Conference on Artificial Intelligence;Innovative Applications of Artificial Intelligence Conference;Symposium on Educational Advances in Artificial Intelligence》|2018年|4629-5745p|共9页
会议地点
作者
Zhongliang Li; Raymond Kulhanek; Yunxin Zhao; Shaojun Wang; Shuang Wu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词

相似文献

外文文献
中文文献
专利

1. A Hybrid Language Model Based on a Recurrent Neural Network and Probabilistic Topic Modeling [J] . M. S. Kudinov, A. A. Romanenko Pattern recognition and image analysis: advances in mathematical theory and applications in the USSR . 2016,第3期

机译：基于递归神经网络和概率主题建模的混合语言模型
2. Improving Recurrent Neural Networks for Offline Arabic Handwriting Recognition by Combining Different Language Models [J] . Jemni Sana Khamekhem, Kessentini Yousri, Kanoun Slim International Journal of Pattern Recognition and Artificial Intelligence . 2020,第12期

机译：通过组合不同的语言模型，改进反际阿拉伯语手写识别的反复性神经网络
3. Recurrent Neural Network based Language Modeling for Punjabi ASR [J] . Vaibhav Kumar International Journal of Computer Science and Engineering . 2020,第9期

机译：基于旁遮普ASR的经常性神经网络的语言建模
4. Slim Embedding Layers for Recurrent Neural Language Models [C] . Zhongliang Li, Raymond Kulhanek, Yunxin Zhao, AAAI Conference on Artificial Intelligence;Innovative Applications of Artificial Intelligence Conference;Symposium on Educational Advances in Artificial Intelligence . 2018

机译：用于复发性神经语言模型的纤薄嵌入层
5. Deep Neural Language Model for Text Classification Based on Convolutional and Recurrent Neural Networks [D] . Hassan, Abdalraouf. 2018

机译：基于卷积神经网络和递归神经网络的深度神经语言文本分类模型
6. Neural Systems Language: A Formal Modeling Language for the Systematic Description Unambiguous Communication and Automated Digital Curation of Neural Connectivity [O] . Ramsay A. Brown, Larry W. Swanson -1

机译：神经系统语言：用于系统描述明确通信和神经连接自动数字化的形式化建模语言
7. Joint Online Spoken Language Understanding and Language Modeling with Recurrent Neural Networks [O] . Liu, Bing, Lane, Ian 2016

机译：联合在线口语语言理解与语言建模递归神经网络

Slim Embedding Layers for Recurrent Neural Language Models

摘要

著录项

相似文献

相关主题

期刊订阅