Improving Cross-Lingual Word Embeddings by Meeting in the Middle

机译：通过中间会议改善跨语言单词嵌入

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Cross-lingual word embeddings are becoming increasingly important in multilingual NLP. Recently, it has been shown that these embeddings can be effectively learned by aligning two disjoint monolingual vector spaces through linear transformations, using no more than a small bilingual dictionary as supervision. In this work, we propose to apply an additional transformation after the initial alignment step, which moves cross-lingual synonyms towards a middle point between them. By applying this transformation our aim is to obtain a better cross-lingual integration of the vector spaces. In addition, and perhaps surprisingly, the monolingual spaces also improve by this transformation. This is in contrast to the original alignment, which is typically learned such that the structure of the monolingual spaces is preserved. Our experiments confirm that the resulting cross-lingual embeddings outperform state-of-the-art models in both monolingual and cross-lingual evaluation tasks.

机译：跨语言单词嵌入在多语言NLP中变得越来越重要。最近，已经证明，仅使用一个小的双语词典作为监督，通过线性变换排列两个不相交的单语向量空间，就可以有效地学习这些嵌入。在这项工作中，我们建议在初始对齐步骤之后应用一个附加的转换，该转换将跨语言的同义词移向它们之间的中间点。通过应用此变换，我们的目标是获得向量空间的更好的跨语言集成。另外，也许令人惊讶的是，这种转换也使单语空间得到了改善。这与原始对准相反，原始对准通常被学习使得保留单语空间的结构。我们的实验证实，所产生的跨语言嵌入在单语言和跨语言评估任务中均优于最新模型。

著录项

来源
《Conference on empirical methods in natural language processing》|2018年|294-304|共11页
会议地点
作者
Yerai Doval; Jose Camacho-Collados; Luis Espinosa-Anke; Steven Schockaert;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
入库时间 2022-08-26 14:33:33

相似文献

外文文献
中文文献
专利

1. Cross-lingual word embeddings [J] . Mordechai Ben-Menachem Computing reviews . 2021,第1期

机译：交叉词嵌入
2. Cross-lingual word embeddings. [J] . Mordechai Ben-Menachem Computing reviews . 2020,第11期

机译：交叉语言词嵌入。
3. A Survey of Cross-lingual Word Embedding Models [J] . Ruder Sebastian, Vulic Ivan, Sogaard Anders The Journal of Artificial Intelligence Research . 2019,第期

机译：跨语言嵌入模型的调查
4. Improving Cross-Lingual Word Embeddings by Meeting in the Middle [C] . Yerai Doval, Jose Camacho-Collados, Luis Espinosa-Anke, Conference on empirical methods in natural language processing . 2018

机译：通过在中间会面来改善交叉单词嵌入
5. Multilingual model using cross-lingual word embeddings based on subword alignment and cross-task projection利用統計を見る [D] . Sakuma Jin 2019

机译：使用基于子词对齐和跨任务投影的跨语言词嵌入的多语言模型
6. BioWordVec improving biomedical word embeddings with subword information and MeSH [O] . Yijia Zhang, Qingyu Chen, Zhihao Yang, 2019

机译：BioWordVec通过子词信息和MeSH改善生物医学词嵌入
7. Data Filtering using Cross-Lingual Word Embeddings [O] . Christian Herold, Jan Rosendahl, Joris Vanvinckenroye, 2021

机译：使用跨语言单词嵌入的数据过滤

Improving Cross-Lingual Word Embeddings by Meeting in the Middle

摘要

著录项

相似文献

相关主题

期刊订阅