Decoder-based Discriminative Training of Phrase Segmentation for Statistical Machine Translation

机译：统计机器翻译的基于解码器的短语细分判别训练

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we propose a new method of training phrase segmentation model for phrase-based statistical machine translation(SMT). We define a good segmentation as the segmentation producing a good translation. According to this definition, we propose a method that can discriminate between a good segmentation and a bad segmentation based on the translation quality. The proposed approach constructs the phrase labeled data by using the SMT decoder, so that the phrase segmentations supporting good translations can be acquired. Furthermore, our iterative training algorithm of the segmentation model can gradually improve the performance of the SMT decoder. Experimental results show that the proposed method is effective in improving the translation quality of the phrase-based SMT system.

机译：本文提出了一种基于短语的统计机器翻译（SMT）训练短语分割模型的新方法。我们将良好的细分定义为产生良好翻译的细分。根据此定义，我们提出了一种基于翻译质量可以区分好的分割和不好的分割的方法。所提出的方法通过使用SMT解码器构造短语标记的数据，从而可以获取支持良好翻译的短语分割。此外，我们的分割模型的迭代训练算法可以逐渐提高SMT解码器的性能。实验结果表明，该方法对提高基于短语的SMT系统的翻译质量是有效的。

著录项

来源
《International conference on computational linguistics》|2012年|611-619|共9页
会议地点
作者
Hyoung - Gyu Lee; Hae - Chang Rim;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
phrase-based SMT; phrase segmentation model; decoder-based approach;

机译：基于短语的SMT;短语分割模型基于解码器的方法;

相似文献

外文文献
中文文献
专利

1. Bayesian Word Alignment and Phrase Table Training for Statistical Machine Translation [J] . Zezhong LI, Hideto IKEDA, Junichi FUKUMOTO IEICE transactions on information and systems . 2013,第7期

机译：统计机器翻译的贝叶斯单词对齐和短语表训练
2. Bayesian Word Alignment and Phrase Table Training for Statistical Machine Translation [J] . Zezhong Li, Hideto Ikeda, Junichi Fukumoto IEICE Transactions on Information and Systems . 2013,第7期

机译：统计机器翻译的贝叶斯单词对齐和短语表训练
3. Integrating Rules and Dictionaries from Shallow-Transfer Machine Translation into Phrase-Based Statistical Machine Translation [J] . P#233, rez-Ortiz Juan Antonio, S#225, The Journal of Artificial Intelligence Research . 2016,第12期

机译：将规则和词典从浅传输机器翻译集成到基于短语的统计机器翻译
4. Decoder-based Discriminative Training of Phrase Segmentation for Statistical Machine Translation [C] . Hyoung - Gyu Lee, Hae - Chang Rim International conference on computational linguistics . 2012

机译：基于解码器的统计机器翻译短语分段判别培训
5. Discriminative training and variational decoding in machine translation via novel algorithms for weighted hypergraphs. [D] . Li, Zhifei. 2010

机译：通过新颖的加权超图算法，在机器翻译中进行判别训练和变异解码。
6. SNPs rs11240569 rs708727 and rs823156 in SLC41A1 Do Not Discriminate Between Slovak Patients with Idiopathic Parkinson’s Disease and Healthy Controls: Statistics and Machine-Learning Evidence [O] . Michal Cibulka, Maria Brodnanova, Marian Grendar, 2019

机译：SLC41A1中的SNP rs11240569rs708727和rs823156不能区分斯洛伐克患有特发性帕金森氏病的患者和健康对照者：统计学和机器学习证据
7. An Iteratively-Trained Segmentation-Free Phrase Translation Model for Statistical Machine Translation [O] . Robert C. Moore, Chris Quirk 2009

机译：统计机器翻译的迭代训练的无分段短语翻译模型

Decoder-based Discriminative Training of Phrase Segmentation for Statistical Machine Translation

摘要

著录项

相似文献

相关主题

期刊订阅