Evaluating the Ability of LSTMs to Learn Context-Free Grammars

机译：评估LSTM学习无上下文语法的能力

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

While long short-term memory (LSTM) neural net architectures are designed to capture sequence information, human language is generally composed of hierarchical structures. This raises the question as to whether LSTMs can learn hierarchical structures. We explore this question with a well-formed bracket prediction task using two types of brackets modeled by an LSTM.Demonstrating that such a system is learnable by an LSTM is the first step in demonstrating that the entire class of CFLs is also learnable. We observe that the model requires exponential memory in terms of the number of characters and embedded depth, where a sub-linear memory should suffice.Still, the model does more than memorize the training input. It learns how to distinguish between relevant and irrelevant information. On the other hand, we also observe that the model does not generalize well.We conclude that LSTMs do not learn the relevant underlying context-free rules, suggesting the good overall performance is attained rather by an efficient way of evaluating nuisance variables. LSTMs are a way to quickly reach good results for many natural language tasks, but to understand and generate natural language one has to investigate other concepts that can make more direct use of natural language's structural nature.

机译：虽然长短期记忆（LSTM）神经网络体系结构旨在捕获序列信息，但人类语言通常由层次结构组成。这就提出了关于LSTM是否可以学习层次结构的问题。我们将使用LSTM建模的两种类型的括号，通过格式正确的括号预测任务来探讨此问题。证明LSTM可学习这样的系统是证明整个CFL也是可学习的第一步。我们观察到该模型在字符数和嵌入深度方面需要指数存储，其中亚线性存储就足够了。尽管如此，该模型的作用还不仅仅在于记忆训练输入。它学习如何区分相关信息和无关信息。另一方面，我们还观察到该模型不能很好地推广。我们得出的结论是LSTM没有学习相关的基础上下文无关规则，这表明可以通过评估扰动变量的有效方法来获得良好的总体性能。 LSTM是一种在许多自然语言任务中快速达到良好结果的方法，但是要理解和生成自然语言，人们必须研究其他可以更直接利用自然语言结构性质的概念。

著录项

来源
《1st EMNLP workshop blackboxNLP: analyzing and interpreting neural networks for NLP 2018》|2018年|115-124|共10页
会议地点 Brussels(BE)
作者
Luzi Sennhauser; Robert C. Berwick;
展开▼
作者单位

Federal Institute of Technology Zurich, Switzerland Massachusetts Institute of Technology Cambridge, MA, USA;

LIDS, Room 32-D728 Massachusetts Institute of Technology Cambridge, MA, USA;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Probabilistic learnability of context-free grammars with basic distributional properties from positive examples [J] . Shibata Chihiro, Yoshinaka Ryo Theoretical computer science . 2016,第Null期

机译：从正例中获取具有基本分布特性的上下文无关文法的概率可学习性
2. Evaluation of several lightweight stochastic context-free grammars for RNA secondary structure prediction [J] . Robin D Dowell, Sean R Eddy BMC Bioinformatics . 2004,第1期

机译：RNA二级结构预测的几种轻量级随机上下文无关文法的评估
3. Optimal probabilistic evaluation functions for search controlled by stochastic context-free grammars [J] . Corazza A., De Mori R. IEEE Transactions on Pattern Analysis and Machine Intelligence . 1994,第10期

机译：随机上下文无关文法控制的搜索的最佳概率评估函数
4. Evaluating the Ability of LSTMs to Learn Context-Free Grammars [C] . Luzi Sennhauser, Robert C. Berwick Conference on empirical methods in natural language processing . 2018

机译：评估LSTMS学习无背景语法的能力
5. TRANSPOSITION GRAMMARS AND PRODUCTION PROBABILITY ESTIMATORS FOR CONTEXT-FREE GRAMMARS (AUTOMATA, STATISTICS, RANDOM WALKS). [D] . HUMENIK, KEITH. 1985

机译：无上下文语法（自动，统计，随机漫步）的移位语法和生产概率估计。
6. Evaluation of several lightweight stochastic context-free grammars for RNA secondary structure prediction [O] . Robin D Dowell, Sean R Eddy 2004

机译：RNA二级结构预测的几种轻量级随机上下文无关文法的评估
7. Evaluating the Ability of LSTMs to Learn Context-Free Grammars [O] . Luzi Sennhauser, Robert Berwick 2018

机译：评估LSTMS学习无背景语法的能力

Evaluating the Ability of LSTMs to Learn Context-Free Grammars

摘要

著录项

相似文献

相关主题

期刊订阅