Tweaks and Tricks for Word Embedding Disruptions

机译：单词嵌入中断的调整和技巧

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Word embeddings are established as very effective models used in several NLP applications. If they differ in their architecture and training process, they often exhibit similar properties and remain vector space models with continuously-valued dimensions describing the observed data. The complexity resides in the developed strategies for learning the values within each dimensional space. In this paper, we introduce the concept of disruption which we define as a side effect of the training process of embedding models. Disruptions are viewed as a set of embedding values that are more likely to be noise than effective descriptive features. We show that dealing with disruption phenomenon is of a great benefit to bottom-up sentence embedding representation. By contrasting several in-domain and pre-trained embedding models, we propose two simple but very effective tweaking techniques that yield strong empirical improvements on textual similarity task.

机译：单词嵌入被确立为在多个NLP应用程序中使用的非常有效的模型。如果它们的体系结构和训练过程不同，它们通常会表现出相似的属性，并保持向量空间模型，这些模型具有描述观察数据的连续值维。复杂性在于用于学习每个维度空间中的值的已开发策略。在本文中，我们介绍了中断的概念，我们将其定义为嵌入模型训练过程的副作用。干扰被视为一组嵌入值，比有效的描述功能更可能是噪声。我们表明，处理干扰现象对于自下而上的句子嵌入表示具有很大的好处。通过对比几种域内和预训练的嵌入模型，我们提出了两种简单但非常有效的调整技术，它们对文本相似性任务产生了强大的经验改进。

著录项

来源
《International conference on recent advances in natural language processing》|2019年|460-464|共5页
会议地点
作者
Amir Hazem; Nicolas Hernandez;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Introductory Words About TWEAK/Fn14 [J] . Linda C. Burkly Advances in Experimental Medicine and Biology . 2011,第Null期

机译：关于TWEAK / Fn14的介绍性词
2. Fertilizers - Tips, Tricks, and Tweaks [J] . Neil Lipson African Violet Magazine . 2013,第4期

机译：肥料-技巧，窍门和调整
3. WINDOWS 7 - TWEAKS, TIPS AND TRICKS [J] . Andrew Edney Everyday practical electronics . 2012,第6期

机译：WINDOWS 7-TWEAKS，技巧和窍门
4. Tweaks and Tricks for Word Embedding Disruptions [C] . Amir Hazem, Nicolas Hernandez International conference on recent advances in natural language processing . 2019

机译：用于嵌入中断的单词的调整和技巧
5. Disrupting holistic word recognition: Evidence for word-specific visual patterns. [D] . Chernecki, Donna. 1992

机译：破坏整体单词识别：特定单词视觉模式的证据。
6. A Word on Words in Words: How Do Embedded Words Affect Reading? [O] . Joshua Snell, Jonathan Grainger, Mathieu Declerck 2018

机译：单词中的单词：嵌入式单词如何影响阅读？
7. Tweaking the lexicon: Organization of vowel sequences into words [O] . Richard M. Warren, James A. Bashford, Daniel A. Gardner 1990

机译：调整词典：将元音序列组织成单词

Tweaks and Tricks for Word Embedding Disruptions

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅