Continuous Distributed Representations of Words as Input of LSTM Network Language Model

机译：单词的连续分布式表示作为LSTM网络语言模型的输入

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The continuous skip-gram model is an efficient algorithm for learning quality distributed vector representations that are able to capture a large number of syntactic and semantic word relationships. Artificial neural networks have become the state-of-the-art in the task of language modelling whereas Long-Short Term Memory (LSTM) networks seem to be efficient training algorithm. In this paper, we carry out experiments with a combination of these powerful models: the continuous distributed representations of words are trained with skip-gram method on a big corpora and are used as the input of LSTM language model instead of traditional 1-of-N coding. The possibilities of this approach are shown in experiments on perplexity with Wikipedia and Penn Treebank corpus.

机译：连续的Skip-Gram模型是一种高效的学习质量分布式矢量表示的算法，可以捕获大量的句法和语义词关系。在语言建模的任务中，人工神经网络已成为最先进的，而长短短期内存（LSTM）网络似乎是有效的训练算法。在本文中，我们对这些强大模型的组合进行了实验：单词的连续分布式表示在大公司的跳过革克方法中培训，用作LSTM语言模型的输入而不是传统的1-of-编码。这种方法的可能性在与维基百科和Penn TreeBank语料库困惑的实验中显示。

著录项

来源
《International Conference on Text, Speech and Dialogue》|2014年||共8页
会议地点
作者
Daniel Soutner; Ludek Muller;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP391.1-53;
关键词
Language modelling; Neural networks; LSTM; Skip-gram; Word2vec;

机译：语言建模;神经网络;LSTM;跳克;WORD2VEC;

相似文献

外文文献
中文文献
专利

1. Comparison of Deep-Neural-Network-Based Models for Estimating Distributed Representations of Compound Words [J] . An Dao, Natthawut Kertkeidkachorn, Ryutaro Ichise Procedia Computer Science . 2021,第a期

机译：基于深神经网络的模型估算复合词分布式表示的比较
2. A Two-Level Recurrent Neural Network Language Model Based on the Continuous Bag-of-Words Model for Sentence Classification [J] . Lee Yo Han, Kim Dong W., Lim Myo Taeg International Journal of Artificial Intelligence Tools: Architectures, Languages, Algorithms . 2019,第1期

机译：一种基于句子分类连续袋式模型的两级反复性神经网络语言模型
3. Mining e-cigarette adverse events in social media using Bi-LSTM recurrent neural network with word embedding representation [J] . Xie Jiaheng, Liu Xiao, Zeng Daniel Dajun Journal of the American Medical Informatics Association : . 2018,第1期

机译：使用BI-LSTM经常性神经网络在社交媒体中采矿电子卷烟不良事件，单词嵌入表示
4. Continuous Distributed Representations of Words as Input of LSTM Network Language Model [C] . Daniel Soutner, Ludek Mueller International conference on text, speech and dialogue . 2014

机译：单词的连续分布式表示形式作为LSTM网络语言模型的输入
5. Learning Distributed Representations for Statistical Language Modelling and Collaborative Filtering. [D] . Mnih, Andriy. 2010

机译：学习用于统计语言建模和协同过滤的分布式表示形式。
6. Mining e-cigarette adverse events in social media using Bi-LSTM recurrent neural network with word embedding representation [O] . Jiaheng Xie, Xiao Liu, Daniel Dajun Zeng 2018

机译：使用带词嵌入表示的Bi-LSTM递归神经网络挖掘社交媒体中的电子烟不良事件
7. On continuous space word representations as input of LSTM language model [O] . Soutner, Daniel, Müller, Luděk 2015

机译：以连续的空间词表示形式作为LSTM语言模型的输入

Continuous Distributed Representations of Words as Input of LSTM Network Language Model

摘要

著录项

相似文献

相关主题

期刊订阅