Interpreting Word-Level Hidden State Behaviour of Character-Level LSTM Language Models

机译：解释字符级LSTM语言模型的单词级隐藏状态行为

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

While Long Short-Term Memory networks (LSTMs) and other forms of recurrent neural network have been successfully applied to language modeling on a character level, the hidden state dynamics of these models can be difficult to interpret. We investigate the hidden states of such a model by using the HDB-SCAN clustering algorithm to identify points in the text at which the hidden state is similar. Focusing on whitespace characters prior to the beginning of a word reveals interpretable clusters that offer insight into how the LSTM may combine contextual and character-level information to identify parts of speech. We also introduce a method for deriving word vectors from the hidden state representation in order to investigate the word-level knowledge of the model. These word vectors encode meaningful semantic information even for words that appear only once in the training text.

机译：虽然长期内存网络（LSTMS）和其他形式的复发性神经网络已经成功应用于字符级别的语言建模，但这些模型的隐藏状态动态可能难以解释。我们通过使用HDB扫描聚类算法来调查这种模型的隐藏状态，以识别隐藏状态类似的文本中的点。专注于单词开始前的空白字符揭示了可解释的集群，以了解LSTM如何组合上下文和字符级信息来识别语音的部分。我们还介绍一种从隐藏状态表示导出字向量的方法，以便调查模型的单词级知识。这些单词矢量甚至编码有意义的语义信息，即使仅在培训文本中只出现一次。

著录项

来源
《Conference on empirical methods in natural language processing》|2018年|xviii 386 p.|共9页
会议地点
作者
Avery Hiebert; Cole Peterson; Alona Fyshe; Nishant A. Mehta;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;
关键词
入库时间 2022-08-20 23:27:00

相似文献

外文文献
中文文献
专利

1. Word-level language modeling for P300 spellers based on discriminative graphical models [J] . Jaime F Delgado Saa, Adriana de Pesters, Dennis McFarland, Journal of neural engineering . 2015,第2期

机译：基于区别性图形模型的P300拼写单词级语言建模
2. Creating word-level language models for large-vocabulary handwriting recognition [J] . John F. Pitrelli, Amit Roy International Journal on Document Analysis and Recognition . 2003,第2a3期

机译：创建用于大型词汇手写识别的词级语言模型
3. Hierarchical and sequential processing of language: A response to: Ding, Melloni, Tian, and Poeppel (2017). Rule-based and word-level statistics-based processing of language: insights from neuroscience. Language, Cognition and Neuroscience. [J] . Frank Stefan L., Christiansen Morten H. Language, cognition and neuroscience . 2018,第9期

机译：语言的分层和顺序处理：响应：丁，Melloni，Tian和Poeppel（2017）。基于规则和基于词语级别的语言处理：神经科学的见解。语言，认知和神经科学。
4. Interpreting Word-Level Hidden State Behaviour of Character-Level LSTM Language Models [C] . Avery Hiebert, Cole Peterson, Alona Fyshe, 1st EMNLP workshop blackboxNLP: analyzing and interpreting neural networks for NLP 2018 . 2018

机译：解释字符级LSTM语言模型的字级隐藏状态行为
5. Interpretable Statistical Learning: From Hidden Markov Models to Neural Networks [D] . Seo, Beomseok. 2021

机译：可解释的统计学习：从隐马尔可夫模型到神经网络
6. Word-level language modeling for P300 spellers based on discriminative graphical models [O] . Jaime F Delgado Saa, Adriana de Pesters, Dennis McFarland, -1

机译：基于区别性图形模型的P300拼写单词级语言建模
7. Interpreting Word-Level Hidden State Behaviour of Character-Level LSTM Language Models [O] . Avery Hiebert, Cole Peterson, Alona Fyshe, 2018

机译：解释字符级LSTM语言模型的单词级隐藏状态行为

Interpreting Word-Level Hidden State Behaviour of Character-Level LSTM Language Models

摘要

著录项

相似文献

相关主题

期刊订阅