Generalizing and Improving Bilingual Word Embedding Mappings with a Multi-Step Framework of Linear Transformations

机译：用线性变换的多步框架揭示和改进双语词嵌入映射

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Using a dictionary to map independently trained word embeddings to a shared space has shown to be an effective approach to learn bilingual word embeddings. In this work, we propose a multi-step framework of linear transformations that generalizes a substantial body of previous work. The core step of the framework is an orthogonal transformation, and existing methods can be explained in terms of the additional normalization, whitening, re-weighting, de-whitening and dimensionality reduction steps. This allows us to gain new insights into the behavior of existing methods, including the effectiveness of inverse regression, and design a novel variant that obtains the best published results in zero-shot bilingual lexicon extraction. The corresponding software is released as an open source project.

机译：使用要将字典映射的独立培训的单词嵌入到共享空间已显示是学习双语单词嵌入的有效方法。在这项工作中，我们提出了一种多步框架，概括了以前的工作的大量工作。框架的核心步骤是正交变换，并且可以根据附加标准化，美白，重求，去美化和维度减少步骤来解释现有方法。这使我们可以获得新的见解，以实现现有方法的行为，包括逆回归的有效性，并设计一种获得最佳发布结果的新型变体，以获得零射击双语词典提取。相应的软件被释放为开源项目。

著录项

来源
《AAAI Conference on Artificial Intelligence;Innovative Applications of Artificial Intelligence Conference;Symposium on Educational Advances in Artificial Intelligence》|2018年|4629-5745p|共8页
会议地点
作者
Mikel Artetxe; Gorka Labaka; Eneko Agirre;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词

相似文献

外文文献
中文文献
专利

1. Improving biomedical word representation with locally linear embedding [J] . Zhao Di, Wang Jian, Chu Yonghe, Neurocomputing . 2021,第Auga4期

机译：用局部线性嵌入改善生物医学词表示
2. Bilingual embeddings with random walks over multilingual wordnets [J] . Goikoetxea Josu, Soroa Aitor, Agirre Eneko Knowledge-Based Systems . 2018,第JUNa15期

机译：在多语言词网上随机游走的双语嵌入
3. Graph-Based Bilingual Word Embedding for Statistical Machine Translation [J] . Wang Rui, Zhao Hai, Ploux Sabine, ACM transactions on Asian language information processing . 2018,第4期

机译：统计机器翻译中基于图的双语词嵌入
4. Generalizing and Improving Bilingual Word Embedding Mappings with a Multi-Step Framework of Linear Transformations [C] . Mikel Artetxe, Gorka Labaka, Eneko Agirre AAAI Conference on Artificial Intelligence;Innovative Applications of Artificial Intelligence Conference;Symposium on Educational Advances in Artificial Intelligence . 2018

机译：用线性变换的多步框架揭示和改进双语词嵌入映射
5. Improved GloVe Word Embedding Using Linear Weighting Scheme for Word Similarity Tasks [D] . Lu, Qinglan. 2021

机译：使用线性加权方案进行改进的手套单词嵌入单词相似性任务
6. Learning linear transformations between counting-based and prediction-based word embeddings [O] . Danushka Bollegala, Kohei Hayashi, Ken-ichi Kawarabayashi 2011

机译：学习基于计数和基于预测的词嵌入之间的线性转换
7. Learning principled bilingual mappings of word embeddings while preserving monolingual invariance [O] . Mikel Artetxe, Gorka Labaka, Eneko Agirre 2016

机译：学习词胚胎的原理双语映射，同时保持单声道不变性

Generalizing and Improving Bilingual Word Embedding Mappings with a Multi-Step Framework of Linear Transformations

摘要

著录项

相似文献

相关主题

期刊订阅