Optimizing segmentation granularity for neural machine translation

Salesky Elizabeth; Runge Andrew; Coda Alex; Niehues Jan; Neubig Graham

首页> 外文期刊>Machine translation >Optimizing segmentation granularity for neural machine translation

【24h】

Optimizing segmentation granularity for neural machine translation

机译：神经电机翻译优化分割粒度

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In neural machine translation (NMT), it has become standard to translate using subword units to allow for an open vocabulary and improve accuracy on infrequent words. Byte-pair encoding (BPE) and its variants are the predominant approach to generating these subwords, as they are unsupervised, resource-free, and empirically effective. However, the granularity of these subword units is a hyperparameter to be tuned for each language and task, using methods such as grid search. Tuning may be done inexhaustively or skipped entirely due to resource constraints, leading to sub-optimal performance. In this paper, we propose a method to automatically tune this parameter using only one training pass. We incrementally introduce new BPE vocabulary online based on the held-out validation loss, beginning with smaller, general subwords and adding larger, more specific units over the course of training. Our method matches the results found with grid search, optimizing segmentation granularity while significantly reducing overall training time. We also show benefits in training efficiency and performance improvements for rare words due to the way embeddings for larger units are incrementally constructed by combining those from smaller units.

机译：在神经机翻译（NMT）中，它已成为标准，可以使用子字单元进行翻译，以允许开放的词汇，并提高不频繁单词的准确性。字节对编码（BPE）及其变体是生成这些子字的主要方法，因为它们是无监督，无资资源和凭证有效的。但是，这些子字单元的粒度是使用诸如网格搜索的方法进行调整的超级参数。由于资源限制，可以完全完成调整或完全跳过，从而导致次优性能。在本文中，我们提出了一种使用只使用一个训练通过自动调整此参数的方法。我们逐步在线在线在线在线在线在线在线，从较小的常规次字开始，并在培训过程中添加更大的更具体的单位。我们的方法与网格搜索的结果匹配，优化分段粒度，同时显着降低整体培训时间。我们还显示培训效率和稀有单词的性能改善由于较大单位的嵌入方式通过与较小单位组合来逐步构建。

著录项

来源
《Machine translation》 |2020年第1期|41-59|共19页
作者
Salesky Elizabeth; Runge Andrew; Coda Alex; Niehues Jan; Neubig Graham;
展开▼
作者单位

Carnegie Mellon Univ Pittsburgh PA 15213 USA;

Carnegie Mellon Univ Pittsburgh PA 15213 USA;

Carnegie Mellon Univ Pittsburgh PA 15213 USA;

Maastricht Univ Maastricht Netherlands;

Carnegie Mellon Univ Pittsburgh PA 15213 USA;

展开▼
收录信息美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Neural machine translation; Subword units; Byte-pair encoding; Online optimization; Segmentation;

机译：神经机翻译;子字单元;字节对编码;在线优化;分割;

相似文献

外文文献
中文文献
专利

1. Tigrinya Morphological Segmentation with Bidirectional Long Short-Term Memory Neural Networks and Its Effect on English-Tigrinya Machine Translation [J] . Yemane Keleta Tedla 人工知能: 人工知能学会誌 . 2019,第1期

机译：双向短期内记忆神经网络的Tigrinya形态分割及其对英语 - Tigrinya Machine翻译的影响
2. Optimizing Non-Decomposable Evaluation Metrics for Neural Machine Translation [J] . Shi-Qi Shen, Yang Liu, Mao-Song Sun 计算机科学技术学报（英文版） . 2017,第004期

机译：优化神经机器翻译的不可分解评估指标
3. Knowledge Graph Enhanced Neural Machine Translation via Multi-task Learning on Sub-entity Granularity [C] . Yang Zhao, Lu Xiang, Junnan Zhu, International Conference on Computational Linguistics . 2020

机译：知识图通过对子实体粒度的多任务学习增强了神经电脑平移
4. Plant Segmentation by Supervised Machine Learning Methods and Phenotypic Trait Extraction of Soybean Plants Using Deep Convolutional Neural Networks with Transfer Learning [D] . Adams, Jason R. 2020

机译：植物分割通过深度卷积神经网络与转移学习的豆豆植物的植物分割和表型特性
5. Optimization of Optical Machine Structure by Backpropagation Neural Network Based on Particle Swarm Optimization and Bayesian Regularization Algorithms [O] . Xinyong Zhang, Liwei Sun 2021

机译：基于粒子群优化和贝叶斯正则化算法的反向化神经网络优化光学机械结构

Optimizing segmentation granularity for neural machine translation

摘要

著录项

相似文献

相关主题

期刊订阅