首页> 外文会议>First workshop on algorithms and resources for modelling of dialects and language varieties 2011 >Dialect Translation: Integrating Bayesian Co-segmentation Models with Pivot-based SMT
【24h】

Dialect Translation: Integrating Bayesian Co-segmentation Models with Pivot-based SMT

机译:方言翻译:将贝叶斯共细分模型与基于数据透视的SMT集成在一起

获取原文
获取原文并翻译 | 示例

摘要

Recent research on multilingual statistical machine translation (SMT) focuses on the usage of pivot languages in order to overcome resource limitations for certain language pairs. This paper proposes a new method to translate a dialect language into a foreign language by integrating transliteration approaches based on Bayesian co-segmentation (BCS) models with pivot-based SMT approaches. The advantages of the proposed method with respect to standard SMT approaches are three fold: (1) it uses a standard language as the pivot language and acquires knowledge about the relation between dialects and the standard language automatically, (2) it reduces the translation task complexity by using monotone decoding techniques, (3) it reduces the number of features in the log-linear model that have to be estimated from bilingual data. Experimental results translating four Japanese dialects (Kumamoto, Kyoto, Okinawa, Osaka) into four Indo-European languages (English, German, Russian, Hindi) and two Asian languages (Chinese, Korean) revealed that the proposed method improves the translation quality of dialect translation tasks and outperforms standard pivot translation approaches concatenating SMT engines for the majority of the investigated language pairs.
机译:最近对多语言统计机器翻译(SMT)的研究集中在枢轴语言的使用上,以克服某些语言对的资源限制。本文提出了一种新方法,将基于贝叶斯协同分段(BCS)模型的音译方法与基于枢轴的SMT方法相集成,从而将方言语言翻译为外语。相对于标准SMT方法,该方法的优点有三方面:(1)它以标准语言为中心语言,并自动获得有关方言和标准语言之间关系的知识,(2)减少了翻译任务通过使用单调解码技术来解决复杂性问题,(3)它减少了必须从双语数据中估计的对数线性模型中的特征数量。实验结果将四种日语(熊本,京都,冲绳,大阪)翻译成四种印欧语(英语,德语,俄语,印地语)和两种亚洲语言(中文,韩语),表明该方法提高了方言的翻译质量转换任务并胜过将大多数SMT引擎连接在一起的标准枢轴转换方法,从而适用于大多数调查的语言对。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号