Revisit Word Embeddings with Semantic Lexicons for Modeling Lexical Contrast

机译：重访带有语义词汇的词嵌入，以建立词汇对比模型

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

It is widely accepted that traditional word embedding models, which rely on distributional semantics hypothesis, are relatively limited for contrast meaning problem. Distributional semantics hypothesis indicates that words lying in similar contexts have similar representations in vector space. Nevertheless, synonyms and antonyms often locate in similar contexts, which means they appear close to each other in vector space. Hence, it is of great difficulty to distinguish antonyms from synonyms. To address this challenge, we propose an optimization model, named Lexicon-based Word Embedding Tuning (LWET) model. The goal of LWET is to incorporate reliable semantic lexicons to tune the distributions of pre-trained word embeddings in the vector space so as to improve their ability of distinguishing antonyms from synonyms. To speed up the training process of LWET, we propose two approximation algorithms, including positive sampling and quasi-hierarchical softmax. Compared with quasi-hierarchical softmax, positive sampling is faster, however, at the cost of worse performance. In experiments, LWET and other state-of-the-art models are tested on antonyms recognition, distinguishing antonyms from synonyms and word similarity. The results of the first two experiments show that LWET significantly improves the ability of word embeddings to detect antonyms, thus achieving the state-of-the-art performance. On word similarity, LWET gets slightly better performance than the state-of-the-art models. It means that LWET can remain and strengthen the semantic structure rather than destroy it when tuning word distributions in vector space. In general, compared with related work, LWET can not only achieve similar or even better performance, but also speed up the training process.

机译：众所周知，依赖于分布语义假说的传统词嵌入模型在对比意义问题上相对有限。分布语义学假设表明，位于相似上下文中的单词在向量空间中具有相似的表示形式。但是，同义词和反义词通常位于相似的上下文中，这意味着它们在向量空间中看起来彼此接近。因此，将反义词与同义词区别开来是非常困难的。为了解决这一挑战，我们提出了一个优化模型，称为基于词汇的词嵌入调整（LWET）模型。 LWET的目标是结合可靠的语义词典来调整向量空间中预训练单词嵌入的分布，从而提高其区分反义词和同义词的能力。为了加快LWET的训练过程，我们提出了两种近似算法，包括正采样和准分层softmax。与准分层softmax相比，正采样速度更快，但代价是性能较差。在实验中，对LWET和其他最新模型进行了反义词识别测试，将反义词与同义词和单词相似性区分开来。前两个实验的结果表明，LWET显着提高了词嵌入检测反义词的能力，从而实现了最新的性能。在词相似性方面，LWET的性能比最新模型好一些。这意味着在调整向量空间中的单词分布时，LWET可以保留并加强语义结构，而不是破坏语义结构。通常，与相关工作相比，LWET不仅可以达到类似甚至更好的性能，而且可以加快培训过程。

著录项

来源
《2017 IEEE International Conference on Big Knowledge》|2017年|72-79|共8页
会议地点 Hefei(CN)
作者
Jiawei Liu; Zhenyu Liu; Huanhuan Chen;
展开▼
作者单位

Univ. of Sci. Technol. of China, Hefei, China;

Univ. of Sci. Technol. of China, Hefei, China;

Univ. of Sci. Technol. of China, Hefei, China;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Semantics; Tuning; Training; Computational modeling; Approximation algorithms; Computational complexity; Vocabulary;

机译：语义;调整;训练;计算建模;近似算法;计算复杂度;词汇;

相似文献

外文文献
中文文献
专利

1. "Are there lexicons?" A study of lexical and semantic processing in word-meaning deafness suggests "yes" [J] . BormannT., WeillerC. Cortex: A Journal Devoted to the Study of the Nervous System and Behavior . 2012,第3期

机译：“有词典吗？”对词性聋的词汇和语义处理的研究表明“是”
2. ‘Are there Lexicons?’ A Study of Lexical and Semantic Processing in a Subject with Word-meaning Deafness Suggests ‘Yes’ [J] . Tobias Bormann, Cornelius Weiller Procedia - Social and Behavioral Sciences . 2010,第2期

机译：“存在词汇词典吗？”对具有词性聋的受试者的词汇和语义处理的研究表明“是”
3. Lexical acquisition and semantic space models: Learning the semantics of unknown words [J] . KOSTADIN CHOLAKOV Natural language engineering . 2014,第pta4期

机译：词汇习得和语义空间模型：学习未知词的语义
4. Revisit Word Embeddings with Semantic Lexicons for Modeling Lexical Contrast [C] . Jiawei Liu, Zhenyu Liu, Huanhuan Chen IEEE International Conference on Big Knowledge . 2017

机译：通过语义词典重新审视单词嵌入用于建模词汇对比度
5. Semantic ambiguity in the lexical access of verbs: How data from monolinguals and bilinguals inform a general model of the mental lexicon. [D] . Swanson, Amy Phyllis. 2010

机译：动词的词汇访问中的语义歧义：单语和双语数据如何告知心理词典的一般模型。
6. Modeling the Mental Lexicon as Part of Long-Term and Working Memory and Simulating Lexical Access in a Naming Task Including Semantic and Phonological Cues [O] . Catharina Marie Stille, Trevor Bekolay, Peter Blouw, 2020

机译：将心理莱克逊建模为长期和工作记忆的一部分并在包括语义和语音线索的命名任务中模拟词汇访问
7. ‘Are there Lexicons?’ A Study of Lexical and Semantic Processing in a Subject with Word-meaning Deafness Suggests ‘Yes’ [O] . Bormann Tobias, Weiller Cornelius 2010

机译：“存在词汇词典吗？”对具有词性聋的受试者的词汇和语义处理的研究表明“是”

Revisit Word Embeddings with Semantic Lexicons for Modeling Lexical Contrast

摘要

著录项

相似文献

相关主题

期刊订阅