Bilingual lexicon extraction for a distant language pair using a small parallel corpus

机译：使用小并行语料库的双语词典提取遥远的语言对

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The aim of this thesis proposal is to perform bilingual lexicon extraction for cases in which small parallel corpora are available and it is not easy to obtain monolingual corpus for at least one of the languages. Moreover, the languages are typologically distant and there is no bilingual seed lexicon available. We focus on the language pair Spanish-Nahuatl, we propose to work with morpheme based representations in order to reduce the sparseness and to facilitate the task of finding lexical correspondences between a highly agglutinative language and a fusional one. We take into account contextual information but instead of using a precompiled seed dictionary, we use the distribution and dispersion of the positions of the morphological units as cues to compare the contextual vectors and obtaining the translation candidates.

机译：本文提出的目的是为其中有小并行基层提供的案例进行双语词汇提取，并且不容易获得至少一种语言的单声道语料库。此外，语言是什么类型的遥远，没有双语种子词典。我们专注于语言对西班牙语 - Nahuatl，我们建议使用基于语素的代表，以减少稀疏性，并促进在高凝集语言和忠实的诽谤之间找到词汇对应的任务。我们考虑了上下文信息，而不是使用预编译的种子字典，我们使用形态单位的位置的分布和分散作为提示，以比较上下文向量并获得翻译候选者。

著录项

来源
《Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies 》|2015年||共7页
会议地点
作者
Ximena Gutierrez-Vasques;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程 ;
关键词

相似文献

外文文献
中文文献
专利

1. Automatic Extraction of Bilingual Word Pairs from Parallel Corpora with Various Languages Using Learning for Adjacent Information [J] . Hiroshi Echizen-ya, Kenji Araki, Yoshio Momouchi Systems and Computers in Japan . 2006 ,第13期

机译：通过学习相邻信息自动从平行语料库中提取双语单词对
2. Bilingual Lexicography and Corpus Methods. The Example of German-Basque as Language Pair [J] . David Lindemann Procedia - Social and Behavioral Sciences . 2013 ,第2期

机译：双语词典和语料库方法。德语-巴斯克语作为语言对的示例
3. Automatic extraction of bilingual word pairs using inductive chain learning in various languages [J] . Hiroshi Echizen-ya, Kenji Araki, Yoshio Momouchi Information Processing & Management . 2006 ,第5期

机译：使用各种语言的归纳链学习自动提取双语单词对
4. Bilingual lexicon extraction for a distant language pair using a small parallel corpus [C] . Ximena Gutierrez-Vasques Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies . 2015

机译：使用小型并行语料库提取远距离语言对的双语词典
5. Parallel text mapping of web-based bilingual corpus materials . [D] . Zhu, Qibo. 2009

机译：基于Web的双语语料库材料的并行文本映射。
6. Evaluating a Pivot-Based Approach for Bilingual Lexicon Extraction [O] . Jae-Hoon Kim, Hong-Seok Kwon, Hyeong-Won Seo 2015

机译：评估基于双语词典词汇提取的方法
7. Automatic Extraction of Bilingual Word Pairs from Parallel Corpora with Various Languages Using Learning for Adjacent Information [O] . Hiroshi Echizen-ya, Kenji Araki, Yoshio Momouchi 2014

机译：利用相邻信息学习从不同语言的平行语料库中自动提取双语词对

Bilingual lexicon extraction for a distant language pair using a small parallel corpus

摘要

著录项

相似文献

相关主题

期刊订阅