Improving Cross-Lingual Word Embeddings by Meeting in the Middle

机译：通过在中间会面来改善交叉单词嵌入

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Cross-lingual word embeddings are becoming increasingly important in multilingual NLP. Recently, it has been shown that these embeddings can be effectively learned by aligning two disjoint monolingual vector spaces through linear transformations, using no more than a small bilingual dictionary as supervision. In this work, we propose to apply an additional transformation after the initial alignment step, which moves cross-lingual synonyms towards a middle point between them. By applying this transformation our aim is to obtain a better cross-lingual integration of the vector spaces. In addition, and perhaps surprisingly, the monolingual spaces also improve by this transformation. This is in contrast to the original alignment, which is typically learned such that the structure of the monolingual spaces is preserved. Our experiments confirm that the resulting cross-lingual embeddings outperform state-of-the-art models in both monolingual and cross-lingual evaluation tasks.

机译：跨语言嵌入式在多语言NLP中越来越重要。最近，已经示出了通过线性变换对准两个不相交的单声道向量空间，使用不超过小的双语字典作为监督来有效地学习这些嵌入。在这项工作中，我们建议在初始对齐步骤后应用额外的转换，这使跨语义同义词朝着它们之间的中间点移动。通过应用这种转变，我们的目标是获得矢量空间的更好的交叉整合。此外，令人惊讶的是，单声道空间也通过这种转变来改善。这与原始对准相反，这通常学习，使得可以保留单根空间的结构。我们的实验证实，由此产生的交叉嵌入式在单声道和交叉语言评估任务中优于最先进的模型。

著录项

来源
《Conference on empirical methods in natural language processing》|2018年|cxvi 724 p.|共11页
会议地点
作者
Yerai Doval; Jose Camacho-Collados; Luis Espinosa-Anke; Steven Schockaert;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;
关键词

相似文献

外文文献
中文文献
专利

1. Cross-lingual word embeddings [J] . Mordechai Ben-Menachem Computing reviews . 2021,第1期

机译：交叉词嵌入
2. Cross-lingual word embeddings. [J] . Mordechai Ben-Menachem Computing reviews . 2020,第11期

机译：交叉语言词嵌入。
3. A Survey of Cross-lingual Word Embedding Models [J] . Ruder Sebastian, Vulic Ivan, Sogaard Anders The Journal of Artificial Intelligence Research . 2019,第期

机译：跨语言嵌入模型的调查
4. Improving Cross-Lingual Word Embeddings by Meeting in the Middle [C] . Yerai Doval, Jose Camacho-Collados, Luis Espinosa-Anke, Conference on empirical methods in natural language processing . 2018

机译：通过中间会议改善跨语言单词嵌入
5. Multilingual model using cross-lingual word embeddings based on subword alignment and cross-task projection利用統計を見る [D] . Sakuma Jin 2019

机译：使用基于子词对齐和跨任务投影的跨语言词嵌入的多语言模型
6. BioWordVec improving biomedical word embeddings with subword information and MeSH [O] . Yijia Zhang, Qingyu Chen, Zhihao Yang, 2019

机译：BioWordVec通过子词信息和MeSH改善生物医学词嵌入
7. Data Filtering using Cross-Lingual Word Embeddings [O] . Christian Herold, Jan Rosendahl, Joris Vanvinckenroye, 2021

机译：使用跨语言单词嵌入的数据过滤

Improving Cross-Lingual Word Embeddings by Meeting in the Middle

摘要

著录项

相似文献

相关主题

期刊订阅