Forced Derivation Tree based Model Training to Statistical Machine Translation

机译：基于强制派生树的统计机器翻译模型训练

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

A forced derivation tree (FDT) of a sentence pair {ƒ, e} denotes a derivation tree that can translate ƒ into its accurate target translation e. In this paper, we present an approach that leverages structured knowledge contained in FDTs to train component models for statistical machine translation (SMT) systems. We first describe how to generate different FDTs for each sentence pair in training corpus, and then present how to infer the optimal FDTs based on their derivation and alignment qualities. As the first step in this line of research, we verify the effectiveness of our approach in a BTG-based phrasal system, and propose four FDT-based component models. Experiments are carried out on large scale English-to-Japanese and Chinese-to-English translation tasks, and significant improvements are reported on both translation quality and alignment quality.

机译：句子对{ƒ，e}的强制派生树（FDT）表示可以将ƒ转换成其准确的目标译文e的派生树。在本文中，我们提出了一种利用FDT中包含的结构化知识来训练统计机器翻译（SMT）系统的组件模型的方法。我们首先描述如何在训练语料库中为每个句子对生成不同的FDT，然后介绍如何根据它们的派生和对齐质量来推断最佳FDT。作为该研究领域的第一步，我们验证了我们的方法在基于BTG的短语系统中的有效性，并提出了四个基于FDT的组件模型。在大规模的英语到日语和中文到英语翻译任务上进行了实验，并且在翻译质量和对齐质量上都取得了显着的进步。

著录项

来源
《Conference on empirical methods in natural language processing;Conference on computational natural language learning》|2012年|445-454|共10页
会议地点
作者
Nan Duan; Mu Li; Ming Zhou;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Compositions of Tree-to-Tree Statistical Machine Translation Models [J] . Maletti Andreas International Journal of Foundations of Computer Science . 2018,第5期

机译：树木统计机器翻译模型的组成
2. A neural reordering model based on phrasal dependency tree for statistical machine translation [J] . Farzi Saeed, Faili Heshaam, Kianian Sahar Intelligent data analysis . 2018,第5期

机译：基于短语相关树的神经机器重新排序模型
3. Syntax-Based Chinese-Vietnamese Tree-to-Tree Statistical Machine Translation with Bilingual Features [J] . Gao Shengxiang, Huang Jihao, Xue Mingya, ACM transactions on Asian language information processing . 2019,第4期

机译：基于句法的汉语-越南树到树统计机器翻译与双语功能
4. Forced Derivation Tree based Model Training to Statistical Machine Translation [C] . Nan Duan, Mu Li, Ming Zhou MNLP 2012 . 2012

机译：基于强制推导树的统计机器翻译模型培训
5. Statistical machine translation: Maximum entropy based translation models and search algorithms. [D] . Garcia Varea, Ismael. 2003

机译：统计机器翻译：基于最大熵的翻译模型和搜索算法。
6. Comparison of neuron-based, kernel-based, tree-based and curve-based machine learning models for predicting daily reference evapotranspiration [O] . Lifeng Wu, Junliang Fan 2015

机译：基于神经元，基于核，基于树和基于曲线的机器学习模型的比较，以预测每日参考蒸散量
7. Modeling Letter-to-Phoneme Conversion as a Phrase Based Statistical Machine Translation Problem with Minimum Error Rate Training [O] . Taraka Rama, Anil Kumar Singh, Sudheer Kolachina 2010

机译：以最小的错误率训练将字母到音素转换建模为基于短语的统计机器翻译问题

Forced Derivation Tree based Model Training to Statistical Machine Translation

摘要

著录项

相似文献

相关主题

期刊订阅