Length bias in Encoder Decoder Models and a Case for Global Conditioning

机译：编码器解码器模型中的长度偏差和全局条件的情况

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Encoder-decoder networks are popular for modeling sequences probabilistically in many applications. These models use the power of the Long Short-Term Memory (LSTM) architecture to capture the full dependence among variables, unlike earlier models like CRFs that typically assumed conditional independence among non-adjacent variables. However in practice encoder-decoder models exhibit a bias towards short sequences that surprisingly gets worse with increasing beam size. In this paper we show that such phenomenon is due to a discrepancy between the full sequence margin and the per-element margin enforced by the locally conditioned training objective of a encoder-decoder model. The discrepancy more adversely impacts long sequences, explaining the bias towards predicting short sequences. For the case where the predicted sequences come from a closed set, we show that a globally conditioned model alleviates the above problems of encoder-decoder models. From a practical point of view, our proposed model also eliminates the need for a beam-search during inference, which reduces to an efficient dot-product based search in a vector-space.

机译：编码器-解码器网络在许多应用中普遍用于概率建模。这些模型利用长短期记忆（LSTM）架构的功能来捕获变量之间的完全依赖关系，这与早期的模型（例如CRF）通常假定非相邻变量之间具有条件独立性不同。然而，在实践中，编码器-解码器模型表现出对短序列的偏爱，随着波束大小的增加，这令人惊讶地变得更糟。在本文中，我们证明了这种现象是由于编码器-解码器模型的局部条件训练目标所强制的完整序列余量和每个元素余量之间存在差异。差异对长序列的影响更大，解释了对预测短序列的偏见。对于预测序列来自封闭集的情况，我们证明了全局条件模型减轻了编码器-解码器模型的上述问题。从实际的角度来看，我们提出的模型还消除了在推理过程中进行波束搜索的需求，从而减少了向量空间中基于点积的有效搜索。

著录项

来源
《Conference on empirical methods in natural language processing》|2016年|1516-1525|共10页
会议地点
作者
Pavel Sountsov; Sunita Sarawagi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
入库时间 2022-08-26 14:31:24

相似文献

外文文献
中文文献
专利

1. A Scheme for Collective Encoding and Iterative Soft-Decision Decoding of Cyclic Codes of Prime Lengths: Applications to Reed–Solomon, BCH, and Quadratic Residue Codes [J] . Shu Lin, Khaled Abdel-Ghaffar, Juane Li, IEEE Transactions on Information Theory . 2020,第9期

机译：循环长度循环码的集体编码和迭代软判决解码方案：芦苇所罗门，BCH和二次残留码的应用
2. Length-constrained MAP decoding of variable-length encoded Markov sequences [J] . Zhe Wang, Xiaolin Wu IEEE Transactions on Communications . 2006,第7期

机译：长度可变的马尔可夫序列的长度受限MAP解码
3. Effect of encoder-decoder mismatch due to wavelength and time misalignments on the performance of two-dimensional wavelength-time optical code-division multiple access systems [J] . Rhys Adams, Lawrence R. Chen Applied Optics . 2005,第20期

机译：波长和时间未对准引起的编解码器不匹配对二维波长-时间光码分多址系统性能的影响
4. Length bias in Encoder Decoder Models and a Case for Global Conditioning [C] . Pavel Sountsov, Sunita Sarawagi Conference on empirical methods in natural language processing . 2016

机译：编码器解码器模型中的长度偏差以及全球调理的情况
5. Joint source-channel decoding of variable-length encoded sources with applications to image transmission. [D] . Subbalakshmi, Koduvayur Parthasarathy. 2000

机译：可变长度编码源的联合源通道解码及其在图像传输中的应用。
6. Incubation of conditioned fear in the conditioned suppression model in rats: role of food-restriction conditions length of conditioned stimulus and generality to conditioned freezing [O] . Charles L. Pickens, Brittany M. Navarre, Sunila G. Nair -1

机译：在大鼠条件抑制模型中孵育条件恐惧：食物限制条件条件刺激长度的作用以及贯穿条件冻结
7. Length bias in Encoder Decoder Models and a Case for Global Conditioning [O] . Sountsov, Pavel, Sarawagi, Sunita 2016

机译：编码器解码器模型中的长度偏差和全局调节的案例

Length bias in Encoder Decoder Models and a Case for Global Conditioning

摘要

著录项

相似文献

相关主题

期刊订阅