Subword Rcgularization: Improving Neural Network Translation Models with Multiple Subword Candidates

机译：子词标准化：改进具有多个子词候选的神经网络翻译模型

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Subword units are an effective way to alleviate the open vocabulary problems in neural machine translation (NMT). While sentences are usually converted into unique subword sequences, subword segmentation is potentially ambiguous and multiple segmentations are possible even with the same vocabulary. The question addressed in this paper is whether it is possible to harness the segmentation ambiguity as a noise to improve the robustness of NMT. We present a simple regu-larization method, subword regularization. which trains the model with multiple subword segmentations probabilistically sampled during training. In addition, for better subword sampling, we propose a new subword segmentation algorithm based on a unigram language model. We experiment with multiple corpora and report consistent improvements especially on low resource and out-of-domain settings.

机译：子词单元是缓解神经机器翻译（NMT）中开放词汇问题的有效方法。虽然句子通常会转换为唯一的子词序列，但子词分段可能会模棱两可，即使使用相同的词汇，也可能会进行多个分段。本文解决的问题是，是否有可能利用分割模糊性作为噪声来提高NMT的鲁棒性。我们提出一种简单的规则化方法，子词正则化。使用训练期间概率采样的多个子词分割来训练模型。此外，为了更好地进行子词采样，我们提出了一种基于字母组合语言模型的新子词分割算法。我们尝试了多种语料库，并报告了一致的改进，尤其是在资源不足和域外设置方面。

著录项

来源
《Annual meeting of the Association for Computational Linguistics》|2018年|66-75|共10页
会议地点
作者
Taku Kudo;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Selective Combination Of Multiple Neural Networks For Improving Model Prediction In Nonlinear Systems Modelling Through Forward Selection And Backward Elimination [J] . Zainal Ahmad, Jie Zhang Neurocomputing . 2009,第4a6期

机译：多个神经网络的选择性组合，通过前向选择和后向消除，改善非线性系统建模中的模型预测
2. Bayesian selective combination of multiple neural networks for improving long-range predictions in nonlinear process modelling [J] . Zainal Ahmad, Jie Zhang Neural computing & applications . 2005,第1期

机译：多个神经网络的贝叶斯选择性组合可改善非线性过程建模中的远程预测
3. NOVEL APPROACH TO IMPROVE GEOCENTRIC TRANSLATION MODEL PERFORMANCE USING ARTIFICIAL NEURAL NETWORK TECHNOLOGY [J] . Prosper Basommi Laari, Zhenyang Hui, Yao Yevenyo Ziggah, Boletim de Ciências Geodésicas . 2017,第1期

机译：利用人工神经网络技术改善地心翻译模型性能的新方法
4. Subword Rcgularization: Improving Neural Network Translation Models with Multiple Subword Candidates [C] . Taku Kudo Annual meeting of the Association for Computational Linguistics . 2018

机译：子字rcgularization：用多个子字候选改进神经网络转换模型
5. Multilingual model using cross-lingual word embeddings based on subword alignment and cross-task projection利用統計を見る [D] . Sakuma Jin 2019

机译：使用基于子词对齐和跨任务投影的跨语言词嵌入的多语言模型
6. An ensemble of neural models for nested adverse drug events and medication extraction with subwords [O] . Meizhi Ju, Nhung T H Nguyen, Makoto Miwa, 2020

机译：神经模型的集成用于嵌套不良药物事件和带有子词的药物提取
7. Augmenting recurrent neural network language models with subword information [O] . Verwimp Lyan, Pelemans Joris, Van hamme Hugo, 2015

机译：利用子词信息增强递归神经网络语言模型

Subword Rcgularization: Improving Neural Network Translation Models with Multiple Subword Candidates

摘要

著录项

相似文献

相关主题

期刊订阅