首页> 外文会议>Workshop on algorithms and resources for modelling of dialects and language varieties >Dialect Translation: Integrating Bayesian Co-segmentation Models with Pivot-based SMT
【24h】

Dialect Translation: Integrating Bayesian Co-segmentation Models with Pivot-based SMT

机译:方言翻译:与基于枢轴的SMT集成贝叶斯共分割模型

获取原文
获取外文期刊封面目录资料

摘要

Recent research on multilingual statistical machine translation (SMT) focuses on the usage of pivot languages in order to overcome resource limitations for certain language pairs. This paper proposes a new method to translate a dialect language into a foreign language by integrating transliteration approaches based on Bayesian co-segmentation (BCS) models with pivot-based SMT approaches. The advantages of the proposed method with respect to standard SMT approaches are three fold: (1) it uses a standard language as the pivot language and acquires knowledge about the relation between dialects and the standard language automatically, (2) it reduces the translation task complexity by using monotone decoding techniques, (3) it reduces the number of features in the log-linear model that have to be estimated from bilingual data. Experimental results translating four Japanese dialects (Kumamoto, Kyoto, Okinawa, Osaka) into four Indo-European languages (English, German, Russian, Hindi) and two Asian languages (Chinese, Korean) revealed that the proposed method improves the translation quality of dialect translation tasks and outperforms standard pivot translation approaches concatenating SMT engines for the majority of the investigated language pairs.
机译:最近关于多语种统计机器翻译(SMT)的研究侧重于枢轴语言的使用,以克服某些语言对的资源限制。本文提出方法的基础上贝叶斯共同分割(BCS)与基于支点SMT接近模型通过整合音译方言的语言翻译成外语的新方法。所提出的方法关于标准SMT方法的优点是三倍:(1)它使用标准语言作为枢轴语言,自动获取关于方言和标准语言之间关系的知识,(2)它减少了翻译任务通过使用单调解码技术的复杂性,(3)它减少了必须从双语数据估计的日志线性模型中的特征数。翻译四位日本方言(熊本,京都,冲绳,大阪)实验结果分为四个印欧语言(英语,德语,俄语,印地文)和两个亚洲语言(中国,韩国)透露,该方法提高方言的翻译质量翻译任务和优先级标准枢轴翻译接近Consigated语言对的大部分中的SMT发动机。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号