Using on-line available sources of bilingual information for word-level machine translation quality estimation

机译：使用在线可用的双语信息来源，用于字级机器翻译质量估计

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper explores the use of external sources of bilingual information available on-line for word-level machine translation quality estimation (MTQE). These sources of bilingual information are used as a black box to spot sub-segment correspondences between a source-language (SL) sentence S to be translated and a given translation hypothesis T in the target-language (TL). This is done by segmenting both S and T into overlapping sub-segments of variable length and translating them into the TL and the SL, respectively, using the available bilingual sources of information on the fly. A collection of features is then obtained from the resulting sub-segment translations, which is used by a binary classifier to determine which target words in T need to be post-edited. Experiments are conducted based on the data sets published for the word-level MTQE task in the 2014 edition of the Workshop on Statistical Machine Translation (WMT 2014). The sources of bilingual information used are: machine translation (Apertium and Google Translate) and the bilingual concordancer Reverso Context. The results obtained confirm that, using less information and fewer features, our approach obtains results comparable to those of state-of-the-art approaches, and even outperform them in some data sets.

机译：本文探讨了使用的上线为字级机器翻译的质量估计（MTQE）提供的双语外部信息源。双语这些信息源被用作黑盒子至斑点源语言之间的子段的对应（SL）句子S被翻译和在目标语言（TL）一个给定的翻译假设吨。这是由两个分段S和T成可变长度的重叠的子段，并将它们翻译成TL和SL，分别使用的在飞行信息的可用双语源完成的。的特征的集合，然后从所得的子片段的翻译，其中使用由二元分类器以确定T中需要目标字进行后编辑，其获得的。实验是基于出版了2014年版的研讨会统计机器翻译（WMT 2014）的字级MTQE任务的数据集进行。双语信息使用的来源是：机器翻译（Apertium和谷歌翻译）和双语concordancer Reverso翻转语境。所获得的结果证实，使用更少的信息和较少的功能，我们的方法取得的结果与那些状态的最先进的方法，甚至优于它们在某些数据集。

著录项

来源
《Annual conference of European Association for Machine Translation》|2015年||共8页
会议地点
作者
Miquel Espla-Gomis; Felipe Sanchez-Martinez; Mikel L. Forcada;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;
关键词

相似文献

外文文献
中文文献
专利

1. Predicting insertion positions in word-level machine translation quality estimation [J] . Espla-Gomis Miquel, Sanchez-Martinez Felipe, Forcada Mikel L. Applied Soft Computing . 2019,第期

机译：预测词级机器翻译质量估计中的插入位置
2. Word-level confidence estimation for machine translation [J] . Ueffing N, Ney H Computational linguistics . 2007,第1期

机译：机器翻译的词级置信度估计
3. Word-Level Confidence Estimation for Machine Translation [J] . Nicola Ueffing, Hermann Ney Computational linguistics . 2007,第1期

机译：机器翻译的词级置信度估计
4. Using on-line available sources of bilingual information for word-level machine translation quality estimation [C] . Miquel Espla-Gomis, Felipe Sanchez-Martinez, Mikel L. Forcada 18th Annual conference of European Association for Machine Translation . 2015

机译：使用在线双语信息源进行词级机器翻译质量估计
5. Translation technology and translation quality: The use of machine translation and computer-assisted translation and its implications for translation quality control [D] . Sun, Haichen 2005

机译：翻译技术和翻译质量：机器翻译和计算机辅助翻译的使用及其对翻译质量控制的影响
6. Milk Source Identification and Milk Quality Estimation Using an Electronic Nose and Machine Learning Techniques [O] . Fanglin Mu, Yu Gu, Jie Zhang, 2020

机译：使用电子鼻子和机器学习技术乳收源识别和牛奶质量估算
7. Predicting insertion positions in word-level machine translation quality estimation [O] . Miquel Esplà-Gomis, Felipe Sánchez-Martínez, Mikel L. Forcada 2019

机译：预测词级机器翻译质量估计中的插入位置

Using on-line available sources of bilingual information for word-level machine translation quality estimation

摘要

著录项

相似文献

相关主题

期刊订阅