首页> 外文会议>International Speech Communication Association >Data Selection and Smoothing in an Open-Source Systemfor the 2008 NIST Machine Translation Evaluation
【24h】

Data Selection and Smoothing in an Open-Source Systemfor the 2008 NIST Machine Translation Evaluation

机译:2008年NIST机器翻译评估的开源系统中的数据选择和平滑

获取原文

摘要

This paper gives a detailed description of a statistical machinetranslation system developed for the 2008 NIST open MT eval-uation. The system is based on the open source toolkit Moseswith extensions for language model rescoring in a second pass.Significant improvements were obtained with data selectionmethods for the language and translation model. An improve-ment of more than 1 point BLEU on the test set was achieved bya continuous space language model which performs the proba-bility estimation with a neural network. The described systemhas achieved a very good ranking in the 2008 NIST open MTevaluation.
机译:本文给出了为2008年NIST开放式MT评估系统开发的统计机械替代系统的详细描述。该系统基于第二种PASS中的语言模型中的开源工具包MOSESwith用于语言模型。使用语言和翻译模型的数据选择方法获得了显着的改进。在测试集上的超过1点BLEU的改进是通过使用神经网络进行Proba-Bility估计的连续空间语言模型实现的。所描述的系统在2008年NIST开放MTEvaluation中实现了一个非常好的排名。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号