首页> 外文期刊>Applied Artificial Intelligence >ON THE STATISTICAL ESTIMATION OF STOCHASTIC FINITE-STATE TRANSDUCERS IN MACHINE TRANSLATION
【24h】

ON THE STATISTICAL ESTIMATION OF STOCHASTIC FINITE-STATE TRANSDUCERS IN MACHINE TRANSLATION

机译:机器翻译中随机有限状态传感器的统计估计

获取原文
获取原文并翻译 | 示例

摘要

The inference of finite-state transducers from bilingual training data plays an important role in many natural-language tasks and mainly in machine translation. However, there are only a few techniques to infer such models. One of these techniques is the grammatical inference and alignments for transducer inference (GIATI) technique that has proven to be very adequate for speech translation, text-input machine translation, or computer-assisted translation. GIATI is a heuristic technique that requires segmented training data (i.e., the input sentences and the output sentences must be segmented with the restriction that the input segments and the output segments must be monotone aligned). For the purpose of obtaining segmented training data, pure statistical word-alignment models are used. This technique is revisited in this article. The main goal is to formally derive the complete GIATI technique using classical expectation - maximization statistical estimation procedure. This new approach allows us to avoid a hard dependence on heuristic "external" statistical techniques (statistical alignments and n-grams). A first set of experimental results obtained in a machine-translation task are also reported to initially validate this new version of the inference technique of finite-state transducers.
机译:从双语训练数据推断有限状态换能器在许多自然语言任务中,尤其是在机器翻译中,起着重要的作用。但是,只有很少的技术可以推断出这种模型。这些技术之一是语法推断和换能器对齐(GIATI)技术,已被证明非常适合语音翻译,文本输入机器翻译或计算机辅助翻译。 GIATI是一种启发式技术,需要分段的训练数据(即,必须限制输入句段和输出句段必须单调对齐来对输入语句和输出句段进行分段)。为了获得分段的训练数据,使用了纯统计字对齐模型。本文将再次探讨该技术。主要目标是使用经典的期望值-最大化统计估计程序来正式推导完整的GIATI技术。这种新方法使我们避免了对启发式“外部”统计技术(统计比对和n元语法)的严格依赖。还报道了在机器翻译任务中获得的第一组实验结果,以初步验证这种有限状态换能器推理技术的新版本。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号