Length bias in Encoder Decoder Models and a Case for Global Conditioning

机译：编码器解码器模型中的长度偏差以及全球调理的情况

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Encoder-decoder networks are popular for modeling sequences probabilistically in many applications. These models use the power of the Long Short-Term Memory (LSTM) architecture to capture the full dependence among variables, unlike earlier models like CRFs that typically assumed conditional independence among non-adjacent variables. However in practice encoder-decoder models exhibit a bias towards short sequences that surprisingly gets worse with increasing beam size. In this paper we show that such phenomenon is due to a discrepancy between the full sequence margin and the per-element margin enforced by the locally conditioned training objective of a encoder-decoder model. The discrepancy more adversely impacts long sequences, explaining the bias towards predicting short sequences. For the case where the predicted sequences come from a closed set, we show that a globally conditioned model alleviates the above problems of encoder-decoder models. From a practical point of view, our proposed model also eliminates the need for a beam-search during inference, which reduces to an efficient dot-product based search in a vector-space.

机译：编码器 - 解码器网络是在许多应用中建模概率的序列的流行。这些模型使用长短期内存（LSTM）架构的功率来捕获变量之间的完全依赖性，与通常假定非相邻变量之间的条件独立性的CRF等型号不同的依赖性。然而，在实践中，编码器 - 解码器模型表现出朝向短序列的偏差，令人惊讶地与增加的光束尺寸变差。在本文中，我们表明，这种现象是由于由编码器 - 解码器模型的本地条件训练目标强制实施的全序余量和每个元素边缘之间的差异。差异更加不利地影响长序列，解释朝向预测短序列的偏差。对于预测序列来自封闭式集合的情况，我们表明全球调节模型减轻了上述编码器解码器模型的问题。从实际的角度来看，我们所提出的模型还消除了在推理期间对光束搜索的需求，这减少了在向量空间中基于高效的DOT产品搜索。

著录项

来源
《Conference on empirical methods in natural language processing》|2016年|lxxv p. 805-1616|共10页
会议地点
作者
Pavel Sountsov; Sunita Sarawagi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;
关键词
入库时间 2022-08-21 03:06:03

相似文献

外文文献
中文文献
专利

1. A Scheme for Collective Encoding and Iterative Soft-Decision Decoding of Cyclic Codes of Prime Lengths: Applications to Reed–Solomon, BCH, and Quadratic Residue Codes [J] . Shu Lin, Khaled Abdel-Ghaffar, Juane Li, IEEE Transactions on Information Theory . 2020,第9期

机译：循环长度循环码的集体编码和迭代软判决解码方案：芦苇所罗门，BCH和二次残留码的应用
2. Length-constrained MAP decoding of variable-length encoded Markov sequences [J] . Zhe Wang, Xiaolin Wu IEEE Transactions on Communications . 2006,第7期

机译：长度可变的马尔可夫序列的长度受限MAP解码
3. Effect of encoder-decoder mismatch due to wavelength and time misalignments on the performance of two-dimensional wavelength-time optical code-division multiple access systems [J] . Rhys Adams, Lawrence R. Chen Applied Optics . 2005,第20期

机译：波长和时间未对准引起的编解码器不匹配对二维波长-时间光码分多址系统性能的影响
4. Length bias in Encoder Decoder Models and a Case for Global Conditioning [C] . Pavel Sountsov, Sunita Sarawagi Conference on empirical methods in natural language processing . 2016

机译：编码器解码器模型中的长度偏差和全局条件的情况
5. Joint source-channel decoding of variable-length encoded sources with applications to image transmission. [D] . Subbalakshmi, Koduvayur Parthasarathy. 2000

机译：可变长度编码源的联合源通道解码及其在图像传输中的应用。
6. Incubation of conditioned fear in the conditioned suppression model in rats: role of food-restriction conditions length of conditioned stimulus and generality to conditioned freezing [O] . Charles L. Pickens, Brittany M. Navarre, Sunila G. Nair -1

机译：在大鼠条件抑制模型中孵育条件恐惧：食物限制条件条件刺激长度的作用以及贯穿条件冻结
7. Length bias in Encoder Decoder Models and a Case for Global Conditioning [O] . Sountsov, Pavel, Sarawagi, Sunita 2016

机译：编码器解码器模型中的长度偏差和全局调节的案例

Length bias in Encoder Decoder Models and a Case for Global Conditioning

摘要

著录项

相似文献

相关主题

期刊订阅