Layer-Wise De-Training and Re-Training for ConvS2S Machine Translation

Yu Hongfei; Zhou Xiaoqing; Duan Xiangyu; Zhang Min

首页> 外文期刊>ACM transactions on Asian and low-resource language information processing >Layer-Wise De-Training and Re-Training for ConvS2S Machine Translation

【24h】

Layer-Wise De-Training and Re-Training for ConvS2S Machine Translation

机译：Divers2s机器翻译的层展解除培训和重新培训

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The convolutional sequence-to-sequence (ConvS2S) machine translation system is one of the typical neural machine translation (NMT) systems. Training the ConvS2S model tends to get stuck in a local optimum in our pre-studies. To overcome this inferior behavior, we propose to de-train a trained ConvS2S model in a mild way and retrain to find a better solution globally. In particular, the trained parameters of one layer of the NMT network are abandoned by re-initialization while other layers' parameters are kept at the same time to kick off re-optimization from a new start point and safeguard the new start point not too far from the previous optimum. This procedure is executed layer by layer until all layers of the ConvS2S model are explored. Experiments show that when compared to various measures for escaping from the local optimum, including initialization with random seeds, adding perturbations to the baseline parameters, and continuing training (con-training) with the baseline models, our method consistently improves the ConvS2S translation quality across various language pairs and achieves better performance.

机译：卷积序列到序列（CONFS2S）机器翻译系统是典型的神经电机翻译（NMT）系统之一。培训CONVS2S模型往往会在我们的预先研究中陷入当地最佳状态。为了克服这种劣等的行为，我们建议以温和的方式训练训练有素的Convs2s模型，并培训来寻找全球更好的解决方案。特别地，一层NMT网络的训练参数被重新初始化放弃，而其他层次的参数同时保持从新的起点开始重新优化并保护新的起点而不是太远从以前的最佳选择。此过程按层次执行图层，直到探索了CONVS2S模型的所有图层。实验表明，与各种措施相比，从局部最佳逃脱，包括随机种子的初始化，与基线参数的扰动增加，以及随着基线模型的持续训练（Con-Training），我们的方法一致地改善了Convs2S翻译质量各种语言对并实现更好的性能。

著录项

来源
《ACM transactions on Asian and low-resource language information processing》 |2020年第2期|26.1-26.15|共15页
作者
Yu Hongfei; Zhou Xiaoqing; Duan Xiangyu; Zhang Min;
展开▼
作者单位

Soochow Univ Suzhou Peoples R China;

Soochow Univ Suzhou Peoples R China;

Soochow Univ Suzhou Peoples R China;

Soochow Univ Suzhou Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
ConvS2S; neural machine translation; local optimum;

机译：convs2s;神经机翻译;局部最佳;
入库时间 2022-08-18 21:31:04

相似文献

外文文献
中文文献
专利

1. Function words in statistical machine-translated Chinese and original Chinese: A study into the translationese of machine translation systems [J] . Kuo Chen-li Digital scholarship in the humanities . 2019,第4期

机译：统计机器中的功能词 - 翻译的中国和原版中文：一项研究机器翻译系统的研究
2. Japanese-to-English translations of tense, aspect, and modality using machine-learning methods and comparison with machine-translation systems on market [J] . Masaki Murata, Qing Ma, Kiyotaka Uchimoto, Language Resources and Evaluation . 2006,第3a4期

机译：使用机器学习方法进行时态，方面和情态的日语到英语翻译，并与市场上的机器翻译系统进行比较
3. Interpretable Convolutional Neural Network Through Layer-wise Relevance Propagation for Machine Fault Diagnosis [J] . Grezmak John, Zhang Jianjing, Wang Peng, IEEE sensors journal . 2020,第6期

机译：通过用于机器故障诊断的层面相关传播的可解释的卷积神经网络
4. English-Hindi Neural Machine Translation-LSTM Seq2Seq and ConvS2S [C] . Gaurav Tiwari, Arushi Sharma, Aman Sahotra, International Conference on Communication and Signal Processing . 2020

机译：英语-印地语神经机器翻译-LSTM Seq2Seq和ConvS2S
5. Translation technology and translation quality: The use of machine translation and computer-assisted translation and its implications for translation quality control [D] . Sun, Haichen 2005

机译：翻译技术和翻译质量：机器翻译和计算机辅助翻译的使用及其对翻译质量控制的影响
6. Reinforcing Motor Re-Training and Rehabilitation through Games: A Machine-Learning Perspective [O] . Maurizio Schmid 2009

机译：通过游戏加强电机的再训练和康复：机器学习的观点
7. Layer-Wise De-Training and Re-Training for ConvS2S Machine Translation [O] . Hongfei Yu, Xiaoqing Zhou, Xiangyu Duan, 2020

机译：Divers2s机器翻译的层展解除培训和重新培训

Layer-Wise De-Training and Re-Training for ConvS2S Machine Translation

摘要

著录项

相似文献

相关主题

期刊订阅