Learning word embeddings efficiently with noise-contrastive estimation

机译：用噪声对比估计有效地学习单词嵌入

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Continuous-valued word embeddings learned by neural language models have recently been shown to capture semantic and syntactic information about words very well, setting performance records on several word similarity tasks. The best results are obtained by learning high-dimensional embeddings from very large quantities of data, which makes scalability of the training method a critical factor. We propose a simple and scalable new approach to learning word embeddings based on training log-bilinear models with noise-contrastive estimation. Our approach is simpler, faster, and produces better results than the current state-of-the-art method. We achieve results comparable to the best ones reported, which were obtained on a cluster, using four times less data and more than an order of magnitude less computing time. We also investigate several model types and find that the embeddings learned by the simpler models perform at least as well as those learned by the more complex ones.

机译：最近被神经语言模型学习的连续值单词嵌入式，以捕获有关单词的语义和句法信息，并在几个单词相似性任务上设置性能记录。最好的结果是由非常大量的数据，这使得训练方法的一个关键因素的可扩展性学习高维的嵌入获得。我们提出了一个简单，可扩展的新的方法来学习基于训练日志双线性模型与噪声对比估计字的嵌入。我们的做法是更简单，更快，并产生比目前国家的最先进的方法更好的结果。我们实现了与报告的最佳群体的结果相当的结果，其中使用了四倍的数据，超过了计算时间的数量级。我们还调查了多种模型类型，并发现更简单的模型学习的嵌入物至少以及由更复杂的模型中学到的嵌入式。

著录项

来源
《Annual conference on Neural Information Processing Systems》|2013年||共9页
会议地点
作者
Andriy Mnih; Koray Kavukcuoglu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类信息处理（信息加工）;
关键词

相似文献

外文文献
中文文献
专利

1. Learning multi-prototype word embedding from single-prototype word embedding with integrated knowledge [J] . Yang Xuefeng, Mao Kezhi Expert Systems with Application . 2016,第Sepa期

机译：从具有集成知识的单原型词嵌入中学习多原型词嵌入
2. Combining and Learning Word Embedding With WordNet for Semantic Relatedness and Similarity Measurement [J] . Yang-Yin Lee, Hao Ke, Ting-Yu Yen, Journal of the Association for Information Science and Technology . 2020,第6期

机译：结合和学习用WordNet嵌入用于语义相关性和相似性测量的单词
3. Learning bag-of-embedded-words representations for textual information retrieval [J] . Passalis Nikolaos, Tefas Anastasios Pattern Recognition: The Journal of the Pattern Recognition Society . 2018,第期

机译：学习文本信息检索的嵌入文字表示
4. Learning word embeddings efficiently with noise-contrastive estimation [C] . Andriy Mnih, Koray Kavukcuoglu Annual conference on Neural Information Processing Systems . 2013

机译：通过噪声对比估计有效地学习单词嵌入
5. Efficient Machine Learning Inference for Embedded Systems with Integer Based Restricted Boltzmann Machines Classifiers [D] . Sosa Barillas, Bryan Samuel. 2019

机译：基于整数的受限Boltzmann机器分类器的嵌入式系统有效的机器学习推断
6. Improving the learning of chemical-protein interactions from literature using transfer learning and specialized word embeddings [O] . P Corbett, J Boyle 2018

机译：使用转移学习和专门的词嵌入来改善文学中化学-蛋白质相互作用的学习
7. Learning neural trans-dimensional random field language models with noise-contrastive estimation [O] . Wang, Bin, Ou, Zhijian 2017

机译：用maTLaB学习神经跨维随机场语言模型噪声对比估计

Learning word embeddings efficiently with noise-contrastive estimation

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅