Classical Structured Prediction Losses for Sequence to Sequence Learning

机译：序列序列学习的经典结构化预测损失

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

There has been much recent work on training neural attention models at the sequence-level using either reinforcement learning-style methods or by optimizing the beam. In this paper, we survey a range of classical objective functions that have been widely used to train linear models for structured prediction and apply them to neural sequence to sequence models. Our experiments show that these losses can perform surprisingly well by slightly outperforming beam search optimization in a like for like setup. We also report new state of the art results on both IWSLT'14 German-English translation as well as Giga-word abstractive summarization. On the large WMT'14 English-French task, sequence-level training achieves 41.5 BLEU which is on par with the state of the art.

机译：最近在序列级别使用强化学习式方法或优化光束培训了序列级别的训练模型。在本文中，我们调查了一系列已广泛用于培训用于结构化预测的线性模型的一系列经典客观功能，并将其应用于序列模型的神经序列。我们的实验表明，这些损失可以通过略微优先表现出类似的光束搜索优化来表现出令人惊讶的方式。我们还向IWSLT'14德语翻译以及Giga-Word抽象摘要报告了新的最新技术结果。在大型WMT'14英国法国任务中，序列级培训实现41.5 BLEU，与现有技术相提并论。

著录项

来源
《Annual conference of the North American Chapter of the Association for Computational Linguistics: human language technologies》|2018年|lxiv 755 p.|共10页
会议地点
作者
Sergey Edunov; Myle Ott; Michael Auli; David Grangier; MarcAurelio Ranzato;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;
关键词
入库时间 2022-08-20 20:12:53

相似文献

外文文献
中文文献
专利

1. Single-sequence-based prediction of protein secondary structures and solvent accessibility by deep whole-sequence learning [J] . Heffernan Rhys, Paliwal Kuldip, Lyons James, Journal of Computational Chemistry: Organic, Inorganic, Physical, Biological . 2018,第25a26期

机译：基于单序列的蛋白质二级结构预测和深度全序学习的溶剂可接近性
2. A sequence to sequence learning based car-following model for multi-step predictions considering reaction delay [J] . Ma Lijing, Qu Shiru Transportation research . 2020,第Nova期

机译：考虑反应延迟的多步预测的基于学习学习的序列的序列
3. Functional region prediction with a set of appropriate homologous sequences-an index for sequence selection by integrating structure and sequence information with spatial statistics [J] . Wataru Nemoto, Hiroyuki Toh BMC Structural Biology . 2012,第1期

机译：具有一组适当同源序列的功能区预测-通过将结构和序列信息与空间统计信息集成在一起来进行序列选择的索引
4. Classical Structured Prediction Losses for Sequence to Sequence Learning [C] . Sergey Edunov, Myle Ott, Michael Auli, Annual conference of the North American Chapter of the Association for Computational Linguistics: human language technologies . 2018

机译：序列到序列学习的经典结构化预测损失
5. Improvement of rna secondary structure prediction from sequence through optical melting studies and of RNA tertiary structure prediction by structural characterization of trinucleotide bulges. [D] . Meyer, Nicole L. 2015

机译：通过光学熔解研究从序列改进rna二级结构预测，并通过三核苷酸凸起的结构表征改进RNA三级结构预测。
6. Functional region prediction with a set of appropriate homologous sequences-an index for sequence selection by integrating structure and sequence information with spatial statistics [O] . Wataru Nemoto, Hiroyuki Toh 2012

机译：具有一组适当同源序列的功能区预测-通过将结构和序列信息与空间统计信息集成在一起来进行序列选择的索引
7. Classical Structured Prediction Losses for Sequence to Sequence Learning [O] . Edunov, Sergey, Ott, Myle, Auli, Michael, 2017

机译：序列到序列学习的经典结构预测损失

Classical Structured Prediction Losses for Sequence to Sequence Learning

摘要

著录项

相似文献

相关主题

期刊订阅