首页> 外文会议>International conference on recent advances in natural language processing;Workshop on human-informed translation and interpreting technology >Comparing a Hand-crafted to an Automatically Generated Feature Set for Deep Learning: Pairwise Translation Evaluation
【24h】

Comparing a Hand-crafted to an Automatically Generated Feature Set for Deep Learning: Pairwise Translation Evaluation

机译:手工与自动生成的功能集进行深度学习的比较:成对翻译评估

获取原文

摘要

The automatic evaluation of machine translation (MT) has proven to be a very significant research topic. Most automatic evaluation methods focus on the evaluation of the output of MT as they compute similarity scores that represent translation quality. This work targets on the performance of MT evaluation. We present a general scheme for learning to classify parallel translations, using linguistic information, of two MT model outputs and one human (reference) translation. We present three experiments to this scheme using neural networks (NN). One using string based hand-crafted features (Expl), the second using automatically trained embeddings from the reference and the two MT outputs (one from a statistical machine translation (SMT) model and the other from a neural machine translation (NMT) model), which are learned using NN (Exp2), and the third experiment (Exp3) that combines information from the other two experiments. The languages involved are English (EN), Greek (GR) and Italian (IT) segments are educational in domain. The proposed language-independent learning scheme which combines information from the two experiments (experiment 3) achieves higher classification accuracy compared with models using BLEU score information as well as other classification approaches, such as Random Forest (RF) and Support Vector Machine (SVM).
机译:机器翻译(MT)的自动评估已被证明是一个非常重要的研究课题。大多数自动评估方法会着重于对MT输出的评估,因为它们计算的是表示翻译质量的相似性评分。这项工作针对MT评估的性能。我们提出了一种通用的方案,用于学习使用两种语言模型输出和一种人工(参考)翻译的语言信息对并行翻译进行分类。我们使用神经网络(NN)对该方案进行了三个实验。一种使用基于字符串的手工特征(Expl),第二种使用来自参考的自动训练的嵌入以及两个MT输出(一个来自统计机器翻译(SMT)模型,另一个来自神经机器翻译(NMT)模型) ,这是使用NN(Exp2)和结合了其他两个实验的信息的第三个实验(Exp3)来学习的。所涉及的语言是英语(EN),希腊语(GR)和意大利语(IT)领域的教育。与使用BLEU得分信息以及其他分类方法(例如随机森林(RF)和支持向量机(SVM))的模型相比,所提出的结合了来自两个实验(实验3)的信息的独立于语言的学习方案实现了更高的分类准确性。 。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号