Mixture Models for Diverse Machine Translation: Tricks of the Trade

机译：不同机器翻译的混合模型：贸易的技巧

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Mixture models trained via EM are among the simplest, most widely used and well understood latent variable models in the machine learning literature. Surprisingly, these models have been hardly explored in text generation applications such as machine translation. In principle, they provide a latent variable to control generation and produce a diverse set of hypotheses. In practice, however, mixture models are prone to degeneracies - often only one component gets trained or the latent variable is simply ignored. We find that disabling dropout noise in responsibility computation is critical to successful training. In addition, the design choices of parameterization, prior distribution, hard versus soft EM and online versus offline assignment can dramatically affect model performance. We develop an evaluation protocol to assess both quality and diversity of generations against multiple references, and provide an extensive empirical study of several mixture model variants. Our analysis shows that certain types of mixture models are more robust and offer the best trade-off between translation quality and diversity compared to variational models and diverse decoding approaches.

机译：通过EM培训的混合物模型是机器学习文献中最简单，最广泛使用和最良好的潜在变量模型之一。令人惊讶的是，这些模型在文本生成应用中几乎没有探索，例如机器翻译。原则上，它们提供了控制生成的潜在变量，并产生各种假设。然而，在实践中，混合模型易于退化 - 通常只有一个组件受过训练，或者潜在的变量简单地忽略。我们发现，禁用责任计算的丢失噪声对于成功培训至关重要。此外，参数化的设计选择，先前分布，硬质与软件和在线与离线分配可以显着影响模型性能。我们开发评估议定书，以评估对多种参考的几代质量和多样性，并对几种混合物模型变体提供广泛的实证研究。我们的分析表明，与变分模型和多样化的解码方法相比，某些类型的混合模型更加强劲，并在翻译质量和多样性之间提供最佳权衡。

著录项

来源
《International Conference on Machine Learning》|2019年|9846-10577p|共12页
会议地点
作者
Tianxiao Shen; Myle Ott; Michael Auli; MarcAurelio Ranzato;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP181-53;
关键词

相似文献

外文文献
中文文献
专利

1. Explicitly Modeling Word Translations in Neural Machine Translation [J] . Han Dong, Li Junhui, Li Yachao, ACM transactions on Asian language information processing . 2020,第1期

机译：在神经机器翻译中显式建模单词翻译
2. Improving Statistical Machine Translation by Adapting Translation Models to Translationese [J] . Gennadi Lembersk, Noam Orda, Shuly Wintne Computational linguistics . 2013,第4期

机译：通过将翻译模型适应翻译语言来改善统计机器翻译
3. Volume-translated Peng-Robinson equation of state for liquid densities of diverse binary mixtures [J] . Agelia M. Abudour, Sayeed A. Mohammad, Robert L. Robinson Jr., Fluid Phase Equilibria . 2013,第Null期

机译：不同二元混合物的液体密度的体积转换的Peng-Robinson状态方程
4. Mixture Models for Diverse Machine Translation: Tricks of the Trade [C] . Tianxiao Shen, Myle Ott, Michael Auli, International Conference on Machine Learning . 2019

机译：不同机器翻译的混合模型：贸易的技巧
5. Modeling, Relevance in Statistical Machine Translation: Scoring Aligment, Context, and Annotations of Translation Instances. [D] . Phillips, Aaron B. 2012

机译：统计机器翻译中的建模，相关性：评分实例，上下文和翻译实例注释。
6. Five-year update on the mouse model of orthotopic lung transplantation: Scientific uses tricks of the trade and tips for success [O] . Xue Lin, Wenjun Li, Jiaming Lai, 2012

机译：原位肺移植小鼠模型的五年更新：科学用途交易技巧和成功秘诀
7. Linear Mixture Models for Robust Machine Translation [O] . Marine Carpuat, Cyril Goutte, George Foster 2016

机译：用于鲁棒机器翻译的线性混合模型

Mixture Models for Diverse Machine Translation: Tricks of the Trade

摘要

著录项

相似文献

相关主题

期刊订阅