Beyond Context: A New Perspective for Word Embeddings

机译：语境之外：词嵌入的新视角

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Most word embeddings today are trained by optimizing a language modeling goal of scoring words in their context, modeled as a multi-class classification problem. Despite the successes of this assumption, it is incomplete: in addition to its context, orthographical or morphological aspects of words can offer clues about their meaning. In this paper, we define a new modeling framework for training word embeddings that captures this intuition. Our framework is based on the well-studied problem of multi-label classification and, consequently, exposes several design choices for leaturizing words and contexts, loss functions for training and score normalization. Indeed, standard models such as CBOW and fast-Text are specific choices along each of these axes. We show via experiments that by combining feature engineering with embedding learning, our method can outperform CBOW using only 10% of the training data in both the standard word embedding evaluations and also text classification experiments.

机译：如今，大多数单词嵌入都是通过优化在上下文中对单词评分的语言建模目标来训练的，该语言建模目标是多类分类问题。尽管此假设取得了成功，但它还是不完整的：除了其上下文之外，单词的字形或词法方面都可以提供有关其含义的线索。在本文中，我们定义了一个新的建模框架来训练单词嵌入，从而捕获了这种直觉。我们的框架基于经过充分研究的多标签分类问题，因此，提供了几种使单词和上下文趋于饱和的设计选择，针对训练的损失函数和分数归一化。确实，诸如CBOW和fast-Text之类的标准模型是沿着这些轴的特定选择。我们通过实验表明，将特征工程与嵌入学习相结合，在标准词嵌入评估和文本分类实验中，仅使用10％的训练数据，我们的方法就可以胜过CBOW。

著录项

来源
《Eighth joint conference on lexical and computational semantics》|2019年|22-32|共11页
会议地点 Minneapolis(US)
作者
Yichu Zhou; Vivek Srikumar;
展开▼
作者单位

School of Computing University of Utah;

School of Computing University of Utah;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
入库时间 2022-08-26 14:32:21

相似文献

外文文献
中文文献
专利

1. Explicit Instruction of Context-embedded Hyperlinked Thematic Words and Vocabulary Recall [J] . Hassan Soleimani, Maryam Molla Esmaeili Procedia - Social and Behavioral Sciences . 2013,第2期

机译：上下文相关的超链接主题词和词汇记忆的明确说明
2. More than Words in Medical Question-and-Answer Sites: A Content-Context Congruence Perspective [J] . Peng Chih-Hung, Yin Dezhi, Zhang Han Information Systems Research . 2020,第3期

机译：不仅仅是医疗问题和答案网站的文字：内容 - 上下文一致性透视
3. Sociophonetics: The Role of Words, the Role of Context, and the Role of Words in Context [J] . Hay Jennifer Topics in cognitive science . 2018,第04期

机译：社会语音学：单词的作用，上下文的作用以及单词在上下文中的作用
4. Beyond Context: A New Perspective for Word Embeddings [C] . Yichu Zhou, Vivek Srikumar Annual conference of the North American Chapter of the Association for Computational Linguistics: human language technologies . 2019

机译：超越上下文：Word Embeddings的新视角
5. A Method for Extracting Context-Sensitive Semantics of a Concept from a General-Purpose Corpus Using Word Embedding Space and Its Application [D] . Saxena, Aakash. 2020

机译：一种用词嵌入空间和应用程序从通用语料库中提取概念的上下文敏感语义的方法
6. A Word on Words in Words: How Do Embedded Words Affect Reading? [O] . Joshua Snell, Jonathan Grainger, Mathieu Declerck 2018

机译：单词中的单词：嵌入式单词如何影响阅读？
7. Beyond Context: A New Perspective for Word Embeddings [O] . Yichu Zhou, Vivek Srikumar 2019

机译：超越上下文：Word Embeddings的新视角

Beyond Context: A New Perspective for Word Embeddings

摘要

著录项

相似文献

相关主题

期刊订阅