首页> 外文期刊>Journal of Information Recording >Unsupervised Weighting of Transfer Rules in Rule-Based Machine Translation using Maximum-Entropy Approach
【24h】

Unsupervised Weighting of Transfer Rules in Rule-Based Machine Translation using Maximum-Entropy Approach

机译:使用最大熵方法在基于规则的机器翻译中传输规则的无监督权重

获取原文
获取原文并翻译 | 示例
       

摘要

In this paper we present an unsupervised method for learning a model to distinguish between ambiguous selection of structural transfer rules in a rule-based machine translation (MT) system. In rule-based MT systems, transfer rules are the component responsible for converting source language morphological and syntactic structures to target language structures. These transfer rules function by matching a source language pattern of lexical items and applying a sequence of actions. There can, however, be more than one potential sequence of actions for each source language pattern. Our model consists of a set of maximum entropy (or logistic regression) classifiers, one trained for each source language pattern, which select the highest probability sequence of rules for a given sequence of patterns. We perform experiments on the Kazakh - Turkish language pair - a low-resource pair of morphologically-rich languages - and compare our model to two reference MT systems, a rule-based system where transfer rules are applied in a left-to-right longest match manner and to a state-of-the-art system based on the neural encoder-decoder architecture. Our system outforms both of these reference systems in three widely used metrics for machine translation evaluation.
机译:在本文中,我们提出了一种无监督的方法,用于学习模型,区分基于规则的机器翻译(MT)系统中的模糊的结构传输规则。在基于规则的MT系统中,传输规则是负责将源语言形态和句法结构转换为目标语言结构的组件。这些传输规则通过匹配词汇项的源语言模式并应用一系列动作来函数。但是,对于每个源语言模式,可以是多于一个潜在的动作序列。我们的模型由一组最大熵(或逻辑回归)分类器组成,每个源语言模式训练,为给定模式序列选择了最高概率序列。我们对哈萨克 - 土耳其语对进行实验 - 一种低资源对丰富的形态学语言 - 并将我们的模型与两个参考MT系统进行比较,这是一种基于规则的系统,其中在左右最长的最长最长的转移规则基于神经编码器解码器架构的匹配方式和最先进的系统。我们的系统在三种广泛使用的机器翻译评估中的三个广泛使用的指标中实现了这两个参考系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号