Baseline Needs More Love: On Simple Word-Embedding-Based Models and Associated Pooling Mechanisms

机译：基线需要更多的爱：基于简单词嵌入的模型和关联的池化机制

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Many deep learning architectures have been proposed to model the composition-ality in text sequences, requiring a substantial number of parameters and expensive computations. However, there has not been a rigorous evaluation regarding the added value of sophisticated compositional functions. In this paper, we conduct a point-by-point comparative study between Simple Word-Embedding-based Models (SWEMs), consisting of parameter-free pooling operations, relative to word-embedding-based RNN/CNN models. Surprisingly, SWEMs exhibit comparable or even superior performance in the majority of cases considered. Based upon this understanding, we propose two additional pooling strategies over learned word embeddings: (ⅰ) a max-pooling operation for improved interpretability; and (ⅱ) a hierarchical pooling operation, which preserves spatial (n-gram) information within text sequences. We present experiments on 17 datasets encompassing three tasks: (ⅰ) (long) document classification; (ⅱ) text sequence matching; and (ⅲ) short text tasks, including classification and tagging.

机译：已经提出了许多深度学习架构来对文本序列中的构图性进行建模，这需要大量的参数和昂贵的计算。但是，尚未对复杂的合成功能的附加值进行严格的评估。在本文中，我们相对于基于词嵌入的RNN / CNN模型，对基于简单词嵌入的模型（SWEM）进行了逐点比较研究，该模型由无参数的池化操作组成。令人惊讶的是，SWEM在大多数情况下都表现出相当甚至更好的性能。基于这种理解，我们针对学习的单词嵌入提出了两种附加的合并策略：（ⅰ）最大池操作以提高可解释性; （ⅱ）分层池化操作，可在文本序列内保留空间（n元语法）信息。我们目前在包含三个任务的17个数据集上进行实验：（ⅰ）（长）文档分类; （ⅱ）文字序列匹配; （ⅲ）短文本任务，包括分类和标记。

著录项

来源
《Annual meeting of the Association for Computational Linguistics》|2018年|440-450|共11页
会议地点
作者
Dinghan Shen; Guoyin Wang; Wenlin Wang; Martin Renqiang Min; Qinliang Su; Yizhe Zhang; Chunyuan Li; Ricardo Henao; Lawrence Carin;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. How Cox models react to a study-specific confounder in a patient-level pooled dataset: random effects better cope with an imbalanced covariate across trials unless baseline hazards differ [J] . McAndrew Thomas, Redfors Bjorn, Crowley Aaron, Journal of applied statistics . 2019,第9a12期

机译：Cox模型如何在患者水平的汇总数据集中对研究特定混杂因素做出反应：除非基线风险不同，随机效应更好地应对整个试验中的协变量不平衡
2. How Cox models react to a study-specific confounder in a patient-level pooled dataset: random effects better cope with an imbalanced covariate across trials unless baseline hazards differ [J] . McAndrew Thomas, Redfors Bjorn, Crowley Aaron, Journal of applied statistics . 2019,第9a12期

机译：Cox模型如何在患者级汇总数据集中对特定于学习的混淆作出反应：随机效果更好地应对跨试验的不平衡的协变量，除非基线危险不同
3. DEVELOPMENT OF ROLLOVER CRITERIA BASED ON SIMPLE PHYSICAL MODEL OF ROLLOVER EVENT [J] . M. H. KIM, J. H. OH, J. H. LEE, International Journal of Automotive Technology . 2006,第1期

机译：基于简单滑行事件物理模型的滑行准则开发
4. Baseline Needs More Love: On Simple Word-Embedding-Based Models and Associated Pooling Mechanisms [C] . Dinghan Shen, Guoyin Wang, Wenlin Wang, Annual meeting of the Association for Computational Linguistics . 2018

机译：基线需要更多的爱：在简单的基于词嵌入的模型和相关的池机制
5. Unpuzzling Tetris: Exploring the Mechanisms of Expertise in a Complex, Dynamic Task with Simple Machine Learning Models [D] . Sibert, Catherine . 2019

机译：无沉积的方块：使用简单的机器学习模型探索复杂的动态任务中的专业知识机制
6. Challenges to pooling models of crowding: Implications for visual mechanisms [O] . Ruth Rosenholtz, Dian Yu, Shaiyan Keshvari -1

机译：集合模型的挑战：视觉机制的含义
7. Baseline Needs More Love: On Simple Word-Embedding-Based Models and Associated Pooling Mechanisms [O] . Dinghan Shen, Guoyin Wang, Wenlin Wang, 2018

机译：基线需要更多的爱：在简单的基于词嵌入的模型和相关的池机制

Baseline Needs More Love: On Simple Word-Embedding-Based Models and Associated Pooling Mechanisms

摘要

著录项

相似文献

相关主题

期刊订阅