Towards Understanding Linear Word Analogies

机译：理解线性词类比

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

A surprising property of word vectors is that word analogies can often be solved with vector arithmetic. However, it is unclear why arithmetic operators correspond to non-linear embedding models such as skip-gram with negative sampling (SGNS). We provide a formal explanation of this phenomenon without making the strong assumptions that past theories have made about the vector space and word distribution. Our theory has several implications. Past work has conjectured that linear substructures exist in vector spaces because relations can be represented as ratios; we prove that this holds for SGNS. We provide novel justification for the addition of SGNS word vectors by showing that it automatically down-weights the more frequent word, as weighting schemes do ad hoc. Lastly, we offer an information theoretic interpretation of Euclidean distance in vector spaces, justifying its use in capturing word dissimilarity.

机译：词向量的一个令人惊讶的特性是，通常可以使用向量算术来解决词的类比。但是，尚不清楚为什么算术运算符对应于非线性嵌入模型，例如带有负采样的跳gram（SGNS）。我们对这种现象进行了形式上的解释，而没有做出过去理论对向量空间和单词分布所做的有力假设。我们的理论有几个含义。过去的工作推测向量空间中存在线性子结构，因为关系可以表示为比率。我们证明这适用于SGNS。通过显示SGNS词向量的自动加权，如加权方案确实，我们可以自动降低较频繁的词的权重，从而为添加SGNS词向量提供了新颖的理由。最后，我们提供了向量空间中欧几里得距离的信息理论解释，证明了它在捕获单词不相似性方面的合理性。

著录项

来源
《Annual meeting of the Association for Computational Linguistics》|2019年|3253-3262|共10页
会议地点
作者
Kawin Ethayarajh; David Duvenaud; Graeme Hirst;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Cross-lingual word analogies using linear transformations between semantic spaces [J] . Brychcin Tomas, Taylor Stephen, Svoboda Lukas Expert Systems with Application . 2019,第NOVa期

机译：使用语义空间之间的线性变换进行跨语言类比
2. Cross-lingual word analogies using linear transformations between semantic spaces [J] . Brychcin Tomas, Taylor Stephen, Svoboda Lukas Expert systems with applications . 2019,第Nova期

机译：使用语义空间之间的线性变换的交叉语言类比
3. An Investigation of the Types of Student-Generated Analogies,the Mapping Understanding,and the Mapping Errors in Concept Learning on the Reaction Rate with Generating Analogy [J] . Kyungsun Kim, Sunyoung Hwang, Taehee Noh Journal of the Korean Chemical Society . 2008,第4期

机译：学生生成类比的类型，映射理解以及概念类学习中生成类比反应率的映射错误的调查
4. Towards Understanding Linear Word Analogies [C] . Kawin Ethayarajh, David Duvenaud, Graeme Hirst Annual meeting of the Association for Computational Linguistics . 2019

机译：了解线性词语类比
5. Playing with a double-edged sword: Analogies in biochemistry. [D] . Orgill, MaryKay. 2003

机译：玩一把双刃剑：生物化学类比。
6. The assembly of amyloidogenic yeast sup35 as assessed by scanning (atomic) force microscopy: an analogy to linear colloidal aggregation? [O] . S Xu, B Bevis, M F Arnsdorf 2001

机译：通过扫描（原子）显微镜观察淀粉样生成酵母sup35的组装：与线性胶体聚集类似吗？
7. Towards Understanding Linear Word Analogies [O] . Kawin Ethayarajh, David Duvenaud, Graeme Hirst 2019

机译：了解线性词语类比

Towards Understanding Linear Word Analogies

摘要

著录项

相似文献

相关主题

期刊订阅