Discriminative Training for Log-Linear Based SMT: Global or Local Methods

LEMAO LIU; TIEJUN ZHAO; TARO WATANABE; HAILONG CAO; CONGHUI ZHU

首页> 外文期刊>ACM transactions on Asian language information processing >Discriminative Training for Log-Linear Based SMT: Global or Local Methods

【24h】

Discriminative Training for Log-Linear Based SMT: Global or Local Methods

机译：基于对数线性的SMT的歧视性培训：全局或局部方法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In statistical machine translation, the standard methods such as MERT tune a single weight with regard to a given development data. However, these methods suffer from two problems due to the diversity and uneven distribution of source sentences. First, their performance is highly dependent on the choice of a development set, which may lead to an unstable performance for testing. Second, the sentence level translation quality is not assured since tuning is performed on the document level rather than on sentence level. In contrast with the standard global training in which a single weight is learned, we propose novel local training methods to address these two problems. We perform training and testing in one step by locally learning the sentence-wise weight for each input sentence. Since the time of each tuning step is unnegligible and learning sentence-wise weights for the entire test set means many passes of tuning, it is a great challenge for the efficiency of local training. We propose an efficient two-phase method to put the local training into practice by employing the ultraconservative update. On NIST Chinese-to-English translation tasks with both medium and large scales of training data, our local training methods significantly outperform standard methods with the maximal improvements up to 2.0 BLEU points, meanwhile their efficiency is comparable to that of the standard methods.

机译：在统计机器翻译中，标准方法（例如MERT）针对给定的开发数据调整单个权重。但是，由于源句的多样性和分布不均，这些方法存在两个问题。首先，它们的性能在很大程度上取决于开发集的选择，这可能导致测试性能不稳定。其次，由于在文档级别而不是句子级别执行调整，因此无法确保句子级别的翻译质量。与学习单个权重的标准全局培训相反，我们提出了新颖的本地培训方法来解决这两个问题。通过本地学习每个输入句子的逐字加权，我们一步一步地进行了训练和测试。由于每个调整步骤的时间都可以忽略不计，并且学习整个测试集的逐句权重意味着需要进行多次调整，这对本地培训的效率提出了巨大挑战。我们提出了一种有效的两阶段方法，通过采用超保守更新将本地培训付诸实践。在具有中型和大型训练数据的NIST汉英翻译任务中，我们的本地训练方法明显优于标准方法，最大改进为2.0 BLEU点，同时其效率可与标准方法媲美。

著录项

来源
《ACM transactions on Asian language information processing》 |2014年第4期|17.1-17.25|共25页
作者
LEMAO LIU; TIEJUN ZHAO; TARO WATANABE; HAILONG CAO; CONGHUI ZHU;
展开▼
作者单位

School of Computer Science and Technology, Harbin Institute of Technology, Harbin, China;

School of Computer Science and Technology, Harbin Institute of Technology, Harbin, China;

National Institute of Information and Communications Technology, 3-5 Hikari-dai, Seika-cho, Sorakugun, Kyoto, Japan;

School of Computer Science and Technology, Harbin Institute of Technology, Harbin, China;

School of Computer Science and Technology, Harbin Institute of Technology, Harbin, China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
log-linear model; global training; local training; ultraconservative update;

机译：对数线性模型全球培训;本地培训;超保守更新;
入库时间 2022-08-17 13:41:20

相似文献

外文文献
中文文献
专利

1. A comparative study of RPCL and MCE based discriminative training methods for LVCSR [J] . Zaihu Pang, Shikui Tu, Xihong Wu, Neurocomputing . 2014,第juna25期

机译：基于RPCL和MCE的LVCSR判别式训练方法的比较研究
2. Marine Vessel Re-Identification: A Large-Scale Dataset and Global-and-Local Fusion-Based Discriminative Feature Learning [J] . Qiao Dalei, Liu Guangzhong, Dong Feng, Quality Control, Transactions . 2020,第期

机译：海洋船只重新识别：大规模数据集和基于全球和本地融合的歧视特征学习
3. Content based Image Retrieval based on Different Global and Local Color Histogram Methods: A Survey [J] . Pallikonda Sarah Suhasini, K. Sri Rama Krishna, I. V. Murali Krishna Journal of The Institution of Engineers (India): Series B . 2017,第1期

机译：基于不同全局和局部颜色直方图方法的基于内容的图像检索：一项调查
4. Locally Training the Log-Linear Model for SMT [C] . Lemao Liu, Hailong Cao, Taro Watanabe, Conference on empirical methods in natural language processing;Conference on computational natural language learning . 2012

机译：本地培训SMT的对数线性模型
5. Global and Local Structural Health Monitoring Methods Based on Wireless Telemetry and Boundary-based Thermography [D] . Johnson, Nephi Ross. 2017

机译：基于无线遥测和边界热成像的全球和局部结构健康监测方法
6. Design preferences for global scale: a mixed-methods study of glocalization of an animated video-based health communication intervention [O] . Maya Adam, Rachel P. Chase, Shannon A. McMahon, 2021

机译：全球规模的设计偏好：一种混合方法的动画视频健康沟通干预的神经化
7. A Comparative Study of RPCL and MCE Based Discriminative Training Methods for LVCSR [O] . Zaihu Pang, Xihong Wu, Lei Xu 2015

机译：基于RpCL和mCE的LVCsR判别训练方法的比较研究
8. Comparison of Current Naval Marksmanship Training Vs. Simulation- Based Marksmanship Training with the Use of Indoor Simulated Marksmanship Trainer (Ismt) [R] . Getty, T J 2014

机译：当前海军射击训练的比较使用室内模拟射击训练器（Ismt）进行基于模拟的射击训练

Discriminative Training for Log-Linear Based SMT: Global or Local Methods

摘要

著录项

相似文献

相关主题

期刊订阅