Obtaining Better Static Word Embeddings Using Contextual Embedding Models

机译：使用上下文嵌入模型获取更好的静态单词嵌入

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The advent of contextual word embeddings- representations of words which incorporate semantic and syntactic information from their context-has led to tremendous improvements on a wide variety of NLP tasks. However, recent contextual models have prohibitively high computational cost in many use-cases and are often hard to interpret. In this work, we demonstrate that our proposed distillation method, which is a simple extension of CBOW-based training, allows to significantly improve computational efficiency of NLP applications, while outperforming the quality of existing static embeddings trained from scratch as well as those distilled from previously proposed methods. As a side-effect, our approach also allows a fair comparison of both contextual and static embeddings via standard lexical evaluation tasks.

机译：中文单词嵌入的出现 - 从他们的上下文中包含语义和句法信息的词语 - 导致各种NLP任务的巨大改进。然而，最近的上下文模型在许多用例中具有过高的计算成本，并且通常很难解释。在这项工作中，我们证明了我们提出的蒸馏方法，这是基于CBOW培训的简单延伸，可以显着提高NLP应用的计算效率，同时优于从头划痕培训的现有静态嵌入品的质量以及蒸馏出来的现有静态嵌入品的质量。以前提出的方法。作为副作用，我们的方法还允许通过标准的词法评估任务进行体内和静态嵌入的公平比较。

著录项

来源
《International Joint Conference on Natural Language Processing;Annual Meeting of the Association for Computational Linguistics》|2021年|5241-5253|共13页
会议地点
作者
Prakhar Gupta; Martin Jaggi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Adaptive cross-contextual word embedding for word polysemy with unsupervised topic modeling [J] . Li Shuangyin, Pan Rong, Luo Haoyu, Knowledge-Based Systems . 2021,第Apra22期

机译：与无监督主题建模的自适应交叉上下文词嵌入Word Polysemy
2. Enhancing Contextualised Language Models with Static Character and Word Embeddings for Emotional Intensity and Sentiment Strength Detection in Arabic Tweets [J] . Abdullah I. Alharbi, Phillip Smith, Mark Lee Procedia Computer Science . 2021,第a期

机译：增强具有静态字符和Word Embeddings的语境化语言模型，用于阿拉伯语推文中的情绪强度和情绪强度检测
3. Word Embedding Models for Finding Semantic Relationship between Words in Tamil Language [J] . S. G. Ajay, M. Srikanth, M. Anand Kumar, Indian Journal of Science and Technology . 2016,第45期

机译：查找泰米尔语单词之间语义关系的单词嵌入模型
4. Diachronic Sense Modeling with Deep Contextualized Word Embeddings: An Ecological View [C] . Renfen Hu, Shen Li, Shichen Liang Annual meeting of the Association for Computational Linguistics . 2019

机译：具有深度上下文化词嵌入的历时感官建模：生态学观点
5. An Analysis of Gender Bias in K-12 Assigned Literature Through Comparison of Non-Contextual Word Embedding Models [D] . Mohan, Preeti. 2021

机译：通过非语境词嵌入模型比较k-12分配文献中的性别偏差分析
6. A Word on Words in Words: How Do Embedded Words Affect Reading? [O] . Joshua Snell, Jonathan Grainger, Mathieu Declerck 2018

机译：单词中的单词：嵌入式单词如何影响阅读？
7. SimAlign: High Quality Word Alignments Without Parallel Training Data Using Static and Contextualized Embeddings [O] . Masoud Jalili Sabet, Philipp Dufter, François Yvon, 2020

机译：Simalign：使用静态和上下文化嵌入的没有并行培训数据的高质量字对齐

Obtaining Better Static Word Embeddings Using Contextual Embedding Models

摘要

著录项

相似文献

相关主题

期刊订阅