首页> 外文会议>WMT 2012 >Leave-One-Out Phrase Model Training for Large-Scale Deployment

【24h】

Leave-One-Out Phrase Model Training for Large-Scale Deployment

机译：留下一句话的大型部署模型培训

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Training the phrase table by force-aligning (FA) the training data with the reference translation has been shown to improve the phrasal translation quality while significantly reducing the phrase table size on medium sized tasks. We apply this procedure to several large-scale tasks, with the primary goal of reducing model sizes without sacrificing translation quality. To deal with the noise in the automatically crawled parallel training data, we introduce on-demand word deletions, insertions, and backoffs to achieve over 99% successful alignment rate. We also add heuristics to avoid any increase in OOV rates. We are able to reduce already heavily pruned baseline phrase tables by more than 50% with little to no degradation in quality and occasionally slight improvement, without any increase in OOVs. We further introduce two global scaling factors for re-estimation of the phrase table via posterior phrase alignment probabilities and a modified absolute discounting method that can be applied to fractional counts.

机译：通过强制对齐（FA）培训短语表（FA）具有参考翻译的培训数据，并显示了提高短语翻译质量，同时显着降低了中等大小任务的短语表大小。我们将此程序应用于几个大规模的任务，具有在不牺牲翻译质量的情况下减少模型大小的主要目标。要处理自动爬行并行培训数据中的噪声，我们介绍按需Word删除，插入和退避，以实现超过99％的成功对齐率。我们还添加了启发式，以避免OOV汇率的任何增加。我们能够减少已经过度修剪的基线短语表，超过50％，几乎没有质量下降，偶尔会略有改善，没有任何oov。我们进一步引入了两个全局缩放因子，用于通过后方短语对准概率重新估计短语表和可以应用于分数计数的修改的绝对折扣方法。

著录项

来源
《WMT 2012》|2012年||共8页
会议地点
作者
Joern Wuebker; Mei-Yuh Hwang; Chris Quirk;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 41.175083;
关键词
Leave-One-Out; Phrase; Model;

机译：留下;模型;

相似文献

外文文献
中文文献
专利

1. The Impacts of Node Orientation on Radio Propagation Models for Airborne-Deployed Sensor Networks in Large-Scale Tree Vegetation Terrains [J] . IEEE Transactions on Systems, Man, and Cybernetics . 2020,第1期

机译：节点定向对大规模树木植被地形中机载部署的传感器网络的无线电传播模型的影响
2. Optimal sensor deployment in a large-scale complex drinking water network: Comparisons between a rule-based decision support system and optimization models [J] . Ni-Bin Change, Natthaphon Prapinpongsanone, Andrew Ernest Computers & Chemical Engineering . 2012,第Auga10期

机译：大型复杂饮用水网络中的传感器优化部署：基于规则的决策支持系统和优化模型之间的比较
3. Predictions-on-chip: model-based training and automated deployment of machine learning models at runtime [J] . Pilarski Sebastian, Staniszewski Martin, Bryan Matthew, Software and systems modeling . 2021,第3期

机译：片上预测：运行时在机器学习模型的基于模型的培训和自动部署
4. Leave-One-Out Phrase Model Training for Large-Scale Deployment [C] . Joern Wuebker, Mei-Yuh Hwang, Chris Quirk Workshop on statistical machine translation . 2012

机译：大规模部署的留一法式短语模型训练
5. A Systematic Methodology for Developing Robust Prognostic Models Suitable for Large-Scale Deployment [D] . Li, Pin. 2020

机译：一种适用于适用于大规模部署的强大预后模型的系统方法
6. Pallium Canadas Curriculum Development Model: A Framework to Support Large-Scale Courseware Development and Deployment [O] . José Pereira, Srini Chary, Jeffrey B. Moat, -1

机译：Pallium Canada的课程开发模型：支持大规模课件开发和部署的框架
7. Leave-One-Out Phrase Model Training for Large-Scale Deployment [O] . Wuebker Joern, Hwang Mei-Yuh, Quirk Chris 2012

机译：大规模部署的留一法式短语模型训练

Leave-One-Out Phrase Model Training for Large-Scale Deployment

摘要

著录项

相似文献

相关主题

期刊订阅