首页> 外文会议>International conference on computational linguistics >Understanding the Performance of Statistical MT Systems: A Linear Regression Framework
【24h】

Understanding the Performance of Statistical MT Systems: A Linear Regression Framework

机译:了解统计MT系统的性能:线性回归框架

获取原文

摘要

We present a framework for the analysis of Machine Translation performance. We use mul-tivariate linear models to determine the impact of a wide range of features on translation performance. Our assumption is that variables that most contribute to predict translation performance are the key to understand the differences between good and bad translations. During training, we learn the regression parameters that better predict translation quality using a wide range of input features based on the translation model and the first-best translation hypotheses. We use a linear regression with regularization. Our results indicate that with regularized linear regression, we can achieve higher levels of correlation between our predicted values and the actual values of the quality metrics. Our analysis shows that the performance for in-domain data is largely dependent on the characteristics of the translation model. On the other hand, out-of domain data can benefit from better reordering strategies.
机译:我们提供了一个用于分析机器翻译性能的框架。我们使用多元线性模型来确定各种功能对翻译性能的影响。我们的假设是,最有助于预测翻译性能的变量是理解好翻译与坏翻译之间差异的关键。在训练过程中,我们学习基于翻译模型和最佳翻译假设的回归参数,这些参数可以使用多种输入特征更好地预测翻译质量。我们使用带有正则化的线性回归。我们的结果表明,通过正则化线性回归,我们可以在质量指标的预测值和实际值之间获得更高的相关性。我们的分析表明,域内数据的性能在很大程度上取决于翻译模型的特征。另一方面,域外数据可以受益于更好的重新排序策略。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号