首页> 外文会议>Tenth workshop on building and using comparable corpora 2017 >Acquisition of Translation Lexicons for Historically Unwritten Languages via Bridging Loanwords
【24h】

Acquisition of Translation Lexicons for Historically Unwritten Languages via Bridging Loanwords

机译:通过桥接外来词获得用于历史非书面语言的翻译词典

获取原文
获取原文并翻译 | 示例

摘要

With the advent of informal electronic communications such as social media, colloquial languages that were historically unwritten are being written for the first time in heavily code-switched environments. We present a method for inducing portions of translation lexicons through the use of expert knowledge in these settings where there are approximately zero resources available other than a language informant, potentially not even large amounts of monolingual data. We investigate inducing a Moroccan Darija-English translation lexicon via French loanwords bridging into English and find that a useful lexicon is induced for human-assisted translation and statistical machine translation.
机译:随着诸如社交媒体之类的非正式电子通信的出现,历史上未书写的口语语言首次在大量的代码交换环境中被编写。我们提出了一种通过在这些环境中使用专家知识来诱导翻译词典部分的方法,在这些环境中,除了语言信息提供者之外,几乎还有零资源可用,甚至可能没有大量的单语数据。我们调查了通过桥接到英语的法语借词诱导摩洛哥达里哈语-英语翻译词典的过程,发现为人类辅助翻译和统计机器翻译诱导了有用的词典。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号