Compressing Neural Language Models by Sparse Word Representations

机译：通过稀疏词表示压缩神经语言模型

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Neural networks are among the state-of-the-art techniques for language modeling. Existing neural language models typically map discrete words to distributed, dense vector representations. After information processing of the preceding context words by hidden layers, an output layer estimates the probability of the next word. Such approaches are time- and memory-intensive because of the large numbers of parameters for word embeddings and the output layer. In this paper, we propose to compress neural language models by sparse word representations. In the experiments, the number of parameters in our model increases very slowly with the growth of the vocabulary size, which is almost imperceptible. Moreover, our approach not only reduces the parameter space to a large extent, but also improves the performance in terms of the perplexity measure.

机译：神经网络是语言建模的最新技术之一。现有的神经语言模型通常将离散的单词映射到分布式的密集矢量表示。在通过隐藏层对前面的上下文词进行信息处理之后，输出层估计下一个词的概率。由于用于词嵌入和输出层的大量参数，所以这种方法是时间和存储器密集型的。在本文中，我们建议通过稀疏词表示来压缩神经语言模型。在实验中，我们模型中参数的数量随着词汇量的增加而非常缓慢地增加，这几乎是不可察觉的。此外，我们的方法不仅在很大程度上减少了参数空间，而且在困惑度方面也提高了性能。

著录项

来源
《Annual meeting of the Association for Computational Linguistics》|2016年|226-235|共10页
会议地点
作者
Yunchuan Chen; Lili Mou; Yan Xu; Ge Li; Zhi Jin;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Latent Words Recurrent Neural Network Language Models for Automatic Speech Recognition [J] . Ryo MASUMURA, Taichi ASAMI, Takanobu OBA, IEICE transactions on information and systems . 2019,第12期

机译：潜在词递归神经网络语言模型用于自动语音识别
2. A Two-Level Recurrent Neural Network Language Model Based on the Continuous Bag-of-Words Model for Sentence Classification [J] . Lee Yo Han, Kim Dong W., Lim Myo Taeg International Journal of Artificial Intelligence Tools: Architectures, Languages, Algorithms . 2019,第1期

机译：一种基于句子分类连续袋式模型的两级反复性神经网络语言模型
3. Comparison of Deep-Neural-Network-Based Models for Estimating Distributed Representations of Compound Words [J] . An Dao, Natthawut Kertkeidkachorn, Ryutaro Ichise Procedia Computer Science . 2021,第a期

机译：基于深神经网络的模型估算复合词分布式表示的比较
4. Compressing Neural Language Models by Sparse Word Representations [C] . Yunchuan Chen, Lili Mou, Yan Xu, Annual meeting of the Association for Computational Linguistics . 2016

机译：通过稀疏字表示压缩神经语言模型
5. Joint Optimization of Quantization and Structured Sparsity for Compressed Deep Neural Networks [D] . Srivastava, Gaurav. 2018

机译：压缩深神经网络的量化和结构稀疏性的联合优化
6. EEG decoding of spoken words in bilingual listeners: from words to language invariant semantic-conceptual representations [O] . João M. Correia, Bernadette Jansma, Lars Hausfeld, -1

机译：双语听众中口语的EEG解码：从单词到语言不变的语义概念表示
7. Compressing Neural Language Models by Sparse Word Representations [O] . Chen, Yunchuan, Mou, Lili, Xu, Yan, 2016

机译：用稀疏词表示压缩神经语言模型

Compressing Neural Language Models by Sparse Word Representations

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅