An Empirical Study of Mini-Batch Creation Strategies for Neural Machine Translation

机译：神经机器翻译的小批量创建策略的实证研究

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Training of neural machine translation (NMT) models usually uses mini-batches for efficiency purposes. During the mini-batched training process, it is necessary to pad shorter sentences in a mini-batch to be equal in length to the longest sentence therein for efficient computation. Previous work has noted that sorting the corpus based on the sentence length before making mini-batches reduces the amount of padding and increases the processing speed. However, despite the fact that mini-batch creation is an essential step in NMT training, widely used NMT toolkits implement disparate strategies for doing so, which have not been empirically validated or compared. This work investigates mini-batch creation strategies with experiments over two different datasets. Our results suggest that the choice of a mini-batch creation strategy has a large effect on NMT training and some length-based sorting strategies do not always work well compared with simple shuffling.

机译：训练神经机器翻译（NMT）模型通常使用微型批次以提高效率。在小批量训练过程中，为了有效计算，有必要在小批量中填充较短的句子，使其长度等于其中的最长句子。先前的工作已经指出，在进行小批量处理之前，根据句子长度对语料库进行排序可以减少填充量，并提高处理速度。但是，尽管创建小批量生产是NMT培训中必不可少的步骤，但广泛使用的NMT工具箱却采用了不同的策略来进行此操作，但尚未经过经验验证或比较。这项工作通过对两个不同数据集的实验研究了小批量创建策略。我们的结果表明，选择小批量创建策略对NMT培训有很大影响，并且与简单混洗相比，基于长度的排序策略并不总是很好。

著录项

来源
《First workshop on natural machine translation 2017》|2017年|61-68|共8页
会议地点 Vancouver(CA)
作者
Makoto Morishita; Yusuke Oda; Graham Neubig; Koichiro Yoshino; Katsuhito Sudoh; Satoshi Nakamura;
展开▼
作者单位

NTT Communication Science Laboratories, NTT Corporation;

Nara Institute of Science and Technology;

Carnegie Mellon University ,Nara Institute of Science and Technology;

Nara Institute of Science and Technology ,PRESTO, Japan Science and Technology Agency;

Nara Institute of Science and Technology;

Nara Institute of Science and Technology;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Unsupervised Neural Machine Translation for Similar and Distant Language Pairs: An Empirical Study [J] . Sun Haipeng, Wang Rui, Utiyama Masao, ACM transactions on Asian and low-resource language information processing . 2021,第1期

机译：针对类似和遥远语言对的无监督神经机翻译：实证研究
2. An Empirical Study on Learning Bug-Fixing Patches in the Wild via Neural Machine Translation [J] . Tufano Michele, Watson Cody, Bavota Gabriele, ACM transactions on software engineering and methodology . 2019,第4期

机译：通过神经机器翻译学习野外错误修复补丁的实证研究
3. Efforts toward Service Creation with Neural Machine Translation [J] . Sen Yoshida, Masahide Mizushima, Kimihito Tanaka NTT Technical Review . 2018,第8期

机译：通过神经机器翻译进行服务创建的努力
4. An Empirical Study of Mini-Batch Creation Strategies for Neural Machine Translation [C] . Makoto Morishita, Yusuke Oda, Graham Neubig, Annual meeting of the Association for Computational Linguistics . 2017

机译：神经电机翻译迷你批量创建策略的实证研究
5. The Effect of the Mini-Batch Size on Deep Neural Networks Training. [D] . Soto, Philippe. 2017

机译：最小批量大小对深度神经网络训练的影响。
6. Neural Machine Translation–Based Automated Current Procedural Terminology Classification System Using Procedure Text: Development and Validation Study [O] . Hyeon Joo, Michael Burns, Sai Saradha Kalidaikurichi Lakshmanan, 2021

机译：基于神经电机的自动化当前程序术语分类系统使用过程文本：开发和验证研究
7. An Empirical Study of Mini-Batch Creation Strategies for Neural Machine Translation [O] . Morishita, Makoto, Oda, Yusuke, Neubig, Graham, 2017

机译：神经机器小批量创建策略的实证研究翻译

An Empirical Study of Mini-Batch Creation Strategies for Neural Machine Translation

摘要

著录项

相似文献

相关主题

期刊订阅