Character and Subword-Based word Representation for Neural Language Modeling prediction

机译：基于字符和子词的词表示在神经语言建模预测中的应用

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Most of neural language models use different kinds of embeddings for word prediction. While word embeddings can be associated to each word in the vocabulary or derived from characters as well as factored morphological decomposition, these word representations are mainly used to parametrize the input, i.e. the context of prediction. This work investigates the effect of using subword units (character and factored morphological decomposition) to build output representations for neural language modeling. We present a case study on Czech, a morphologically-rich language, experimenting with different input and output representations. When working with the full training vocabulary, despite unstable training, our experiments show that augmenting the output word representations with character-based embeddings can significantly improve the performance of the model. Moreover, reducing the size of the output look-up table, to let the character-based embeddings represent rare words, brings further improvement.

机译：大多数神经语言模型使用不同类型的嵌入进行单词预测。虽然词嵌入可以与词汇表中的每个词相关联，或者可以从字符中派生出来，以及可以进行因式分解，但这些词表示法主要用于对输入进行参数化，即预测的上下文。这项工作研究了使用子词单元（字符和分解的词法分解）来构建用于神经语言建模的输出表示的效果。我们提供了一个捷克语的案例研究，捷克语是一种形态丰富的语言，并尝试了不同的输入和输出表示形式。当使用完整的训练词汇时，尽管训练不稳定，我们的实验表明，使用基于字符的嵌入来增强输出单词表示形式可以显着提高模型的性能。此外，减小输出查找表的大小，以使基于字符的嵌入表示稀有单词，带来了进一步的改进。

著录项

来源
《First workshop on subword and character level models in NLP 2017》|2017年|1-13|共13页
会议地点 Copenhagen(DK)
作者
Matthieu Labeau; Alexandra Allauzen;
展开▼
作者单位

LMSI-CNRS / Orsay, France;

LIMSI-CNRS / Orsay, France;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Hierarchical Character Embeddings: Learning Phonological and Semantic Representations in Languages of Logographic Origin Using Recursive Neural Networks [J] . Minh Nguyen, Gia H. Ngo, Nancy F. Chen Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2020,第期

机译：分层字符嵌入：使用递归神经网络的逻辑原产语言学习语言和语义表示
2. Structure and modeling of the network of two-Chinese-character compound words in the Japanese language [J] . Ken Yamamoto, Yoshihiro Yamazaki Physica, A. Statistical mechanics and its applications . 2014,第Null期

机译：日语中两个汉字复合词网络的结构与建模
3. A LANGUAGE MODEL BASED ON SEMANTICALLY CLUSTERED WORDS IN A CHINESE CHARACTER RECOGNITION SYSTEM [J] . Lee HJ., Tung CH. Pattern Recognition: The Journal of the Pattern Recognition Society . 1997,第8期

机译：基于汉字字符识别系统中词类聚类语言的语言模型
4. Character and Subword-Based word Representation for Neural Language Modeling prediction [C] . Matthieu Labeau, Alexandra Allauzen Workshop on subword and character level models in NLP . 2017

机译：基于字符和基于子字的神经语言建模预测词表示
5. Learning Representations of Text through Language and Discourse Modeling: From Characters to Sentences. [D] . Jernite, Yacine. 2018

机译：通过语言和话语建模学习文本表示形式：从字符到句子。
6. Quantifying the Adequacy of Neural Representations for a Cross-Language Phonetic Discrimination Task: Prediction of Individual Differences [O] . Rajeev D. S. Raizada, Feng-Ming Tsao, Huei-Mei Liu, -1

机译：量化神经表示形式对跨语言语音识别任务的适当性：个体差异的预测
7. Character and Subword-Based Word Representation for Neural Language Modeling Prediction [O] . Matthieu Labeau, Alexandre Allauzen 2017

机译：基于字符和基于子字的神经语言建模预测词表示

Character and Subword-Based word Representation for Neural Language Modeling prediction

摘要

著录项

相似文献

相关主题

期刊订阅