首页> 外文会议>International joint conference on natural language processing;Conference on empirical methods in natural language processing >Supervised and Nonlinear Alignment of Two Embedding Spaces for Dictionary Induction in Low Resourced Languages
【24h】

Supervised and Nonlinear Alignment of Two Embedding Spaces for Dictionary Induction in Low Resourced Languages

机译:低资源语言中两个归纳空间的监督和非线性对齐,用于词典归纳

获取原文

摘要

Enabling cross-lingual NLP tasks by leveraging multilingual word embedding has recently attracted much attention. An important motivation is to support lower resourced languages, however, most efforts focus on demonstrating the effectiveness of the techniques using embeddings derived from similar languages to English with large parallel content. In this study, we present a noise tolerant piecewise linear technique to learn a non-linear mapping between two monolingual word embedding vector spaces. We evaluate our approach on inferring bilingual dictionaries. We show that our technique outperforms the state of the art in lower resourced settings with an average improvement of 3.7% for precision @10 across 14 mostly low resourced languages.
机译:通过利用多语言单词嵌入来启用跨语言NLP任务近来引起了很多关注。一个重要的动机是支持资源较少的语言,但是,大多数努力都集中在证明技术的有效性上,该技术使用的是从相似语言到具有大量并行内容的英语的嵌入。在这项研究中,我们提出了一种耐噪声的分段线性技术,以学习两个单语单词嵌入向量空间之间的非线性映射。我们评估推断双语词典的方法。我们表明,在资源较少的环境中,我们的技术优于最新技术,在14种大多数资源较少的语言中,平均精度(@ 10)平均提高了3.7%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号