首页> 外文学位 >Syntax-based Language Models for Statistical Machine Translation.
【24h】

Syntax-based Language Models for Statistical Machine Translation.

机译:用于统计机器翻译的基于语法的语言模型。

获取原文
获取原文并翻译 | 示例

摘要

The goal of machine translation is to develop algorithms that produce human-quality translations of natural language sentences. The evaluation of machine translation quality is split broadly into two aspects: adequacy and fluency. Adequacy measures how faithfully the meaning of the original sentence is preserved, whereas fluency measures whether this meaning is expressed in valid sentences in the target language. While both of these criteria are difficult to meet; fluency is a much more difficult goal. Generally, this likely has something to do with the asymmetrical nature of producing and understanding sentences; although humans are quite robust at inferring the meaning of text even in the presence of lots of noise and error, the rules that govern grammatical utterances are exacting, subtle; and elusive. To produce understandable text, we can rely on this robust processing hardware, but to produce grammatical text, we have to understand how it, works.;This dissertation attempts to improve the fluency of machine translation output by explicitly incorporating models of the target language structure into machine translation systems. It is organized into three parts. First, we propose a framework for decoding that decouples the structures of the sentences of the source and target languages, and evaluate it with existing grammatical models as language models for machine translation. Next, we apply lessons from that task to the learning of grammars more suitable to the demands of the machine translation. We then incorporate these grammars, called Tree Substitution Grammars, into our decoding framework.
机译:机器翻译的目标是开发可产生自然语言句子的人类品质翻译的算法。机器翻译质量的评估大致分为两个方面:充分性和流畅性。适当性衡量的是忠实地保留原始句子的含义,而流畅性衡量的是该含义是否以目标语言在有效句子中表达。虽然这两个标准都难以满足;流利是一个困难得多的目标。通常,这可能与产生和理解句子的不对称性有关。尽管即使在存在很多杂音和错误的情况下,人类在推断文本含义方面也非常有力,但是控制语法话语的规则却是精确而微妙的。和难以捉摸。要生成可理解的文本,我们可以依靠这种强大的处理硬件,但要生成语法文本,我们必须了解其工作原理。本论文试图通过显式合并目标语言结构的模型来提高机器翻译输出的流畅性进入机器翻译系统。它分为三个部分。首先,我们提出了一种解码框架,该框架解耦了源语言和目标语言的句子结构,并使用现有的语法模型(作为机器翻译的语言模型)对其进行了评估。接下来,我们将从该任务中吸取经验教训,以学习更适合机器翻译需求的语法。然后,我们将这些称为树替换语法的语法合并到我们的解码框架中。

著录项

  • 作者

    Post, Matthew John.;

  • 作者单位

    University of Rochester.;

  • 授予单位 University of Rochester.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2010
  • 页码 134 p.
  • 总页数 134
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号