Data Selection and Smoothing in an Open-Source Systemfor the 2008 NIST Machine Translation Evaluation

机译：2008年NIST机器翻译评估的开源系统中的数据选择和平滑

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper gives a detailed description of a statistical machinetranslation system developed for the 2008 NIST open MT eval-uation. The system is based on the open source toolkit Moseswith extensions for language model rescoring in a second pass.Significant improvements were obtained with data selectionmethods for the language and translation model. An improve-ment of more than 1 point BLEU on the test set was achieved bya continuous space language model which performs the proba-bility estimation with a neural network. The described systemhas achieved a very good ranking in the 2008 NIST open MTevaluation.

机译：本文给出了为2008年NIST开放式MT评估系统开发的统计机械替代系统的详细描述。该系统基于第二种PASS中的语言模型中的开源工具包MOSESwith用于语言模型。使用语言和翻译模型的数据选择方法获得了显着的改进。在测试集上的超过1点BLEU的改进是通过使用神经网络进行Proba-Bility估计的连续空间语言模型实现的。所描述的系统在2008年NIST开放MTEvaluation中实现了一个非常好的排名。

著录项

来源
《International Speech Communication Association》|2008年||共4页
会议地点
作者
Holger Schwenk; Yannick Esteve;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912.3-532;
关键词
statistical machine translation; continuous spacelanguage model; open source; NIST evaluation;

机译：统计机器翻译;连续空间模型;开源;NIST评估;

相似文献

外文文献
中文文献
专利

1. Combined i-Vector and Extreme Learning Machine Approach for Robust Speaker Identification and Evaluation with SITW 2016, NIST 2008, TIMIT Databases [J] . Al-Kaltakchi Musab T. S., Abdullah Mohammed A. M., Woo Wai L., Circuits, systems and signal processing . 2021,第10期

机译：COMBER 2008，NIST 2008，TIMIT数据库的强大扬声器识别与评估的I-Vector和Extreme Learning Machine方法
2. The NIST 2008 Metrics for machine translation challenge-overview, methodology, metrics, and results [J] . Mark Przybocki, Kay Peterson, Sebastien Bronsart, Machine translation . 2009,第2a3期

机译：用于机器翻译挑战的NIST 2008指标-概述，方法，指标和结果
3. Cunei: open-source machine translation with relevance-based models of each translation instance [J] . Aaron B. Phillips Machine translation . 2011,第2期

机译：Cunei：具有每个翻译实例基于相关性的模型的开源机器翻译
4. Data Selection and Smoothing in an Open-Source Systemfor the 2008 NIST Machine Translation Evaluation [C] . Holger Schwenk, Yannick Esteve International Speech Communication Association . 2008

机译：2008年NIST机器翻译评估的开源系统中的数据选择和平滑
5. Data analysis and selection for statistical machine translation. [D] . Eetemadi, Sauleh. 2016

机译：用于统计机器翻译的数据分析和选择。
6. Revision of the NIST Standard for 223Ra: New Measurements and Review of 2008 Data [O] . B. E. Zimmerman, D. E. Bergeron, J. T. Cessna, 2015

机译：修订223Ra的NIST标准：新的测量和对2008年数据的审查
7. A Comparative Evaluation of Data-driven Models in Translation Selection of Machine Translation [O] . 2013

机译：机器翻译翻译选择中数据驱动模型的比较评估
8. NIST Recommended Practice Guide: Computing Uncertainty for Charpy Impact Machine Test Results. October 2008 [R] . Splett, J. D., Iyer, H. K., Wang, C. M., 2008

机译：NIsT推荐实践指南：计算夏比冲击机测试结果的不确定度。 2008年10月

Data Selection and Smoothing in an Open-Source Systemfor the 2008 NIST Machine Translation Evaluation

摘要

著录项

相似文献

相关主题

期刊订阅