Learning Universal Sentence Representations with Mean-Max Attention Autoencoder

机译：使用均值-最大注意力自动编码器学习通用句子表示

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In order to learn universal sentence representations, previous methods focus on complex recurrent neural networks or supervised learning. In this paper, we propose a mean-max attention autoencoder (mean-max AAE) within the encoder-decoder framework. Our autoencoder rely entirely on the MultiHead self-attention mechanism to reconstruct the input sequence. In the encoding we propose a mean-max strategy that applies both mean and max pooling operations over the hidden vectors to capture diverse information of the input. To enable the information to steer the reconstruction process dynamically, the decoder performs attention over the mean-max representation. By training our model on a large collection of unlabelled data, we obtain high-quality representations of sentences. Experimental results on a broad range of 10 transfer tasks demonstrate that our model outperforms the state-of-the-art unsupervised single methods, including the classical skip-thoughts (Kiros et al., 2015) and the advanced skip-thoughts+LN model (Ba et al., 2016). Furthermore, compared with the traditional recurren-t neural network, our mean-max AAE greatly reduce the training time.

机译：为了学习通用的句子表示，以前的方法着重于复杂的递归神经网络或监督学习。在本文中，我们提出了一种在编码器-解码器框架内的均值-最大注意力自动编码器（mean-max AAE）。我们的自动编码器完全依靠MultiHead自注意力机制来重构输入序列。在编码中，我们提出了一种均值最大策略，该策略对隐藏矢量应用均值和最大池化操作以捕获输入的各种信息。为了使信息能够动态地引导重建过程，解码器对均值-最大表示进行关注。通过在大量未标记数据上训练模型，我们可以获得句子的高质量表示。在10个传输任务的广泛范围上的实验结果表明，我们的模型优于最新的无监督单一方法，包括经典的“跳过思想”（Kiros等人，2015）和高级的“跳过思想+ LN”模型（Ba et al。，2016）。此外，与传统的递归神经网络相比，我们的均值最大AAE大大减少了训练时间。

著录项

来源
《Conference on empirical methods in natural language processing》|2018年|4514-4523|共10页
会议地点
作者
Minghua Zhang; Yunfang Wu; Weikang Li; Wei Li;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. A CNN-LSTM network with attention approach for learning universal sentence representation in embedded system [J] . Fu Qunchao, Wang Cong, Han Xu Microprocessors and microsystems . 2020,第Apra期

机译：一种CNN-LSTM网络，具有在嵌入式系统中学习普遍句子表示的注意方法
2. Incorporating representation learning and multihead attention to improve biomedical cross-sentence n-ary relation extraction [J] . Di Zhao, Jian Wang, Yijia Zhang, BMC Bioinformatics . 2020,第1期

机译：纳入代表学习和多重关注，以改善生物医学跨句N-ary关系提取
3. An analysis on the use of autoencoders for representation learning: Fundamentals, learning task case studies, explainability and challenges [J] . Charte David, Charte Francisco, del Jesus Maria J., Neurocomputing . 2020,第Sepa3期

机译：AutoEncoders对代表学习的使用分析：基础，学习任务案例研究，解释性和挑战
4. Learning Universal Sentence Representations with Mean-Max Attention Autoencoder [C] . Minghua Zhang, Yunfang Wu, Weikang Li, Conference on empirical methods in natural language processing . 2018

机译：学习普遍句子表示与均值 - 最大关注autoencoder
5. Representation Learning with Restorative Autoencoders for Transfer Learning [D] . Fichuk, Dexter. 2020

机译：用恢复性AutoEncoders进行转移学习的表示学习
6. Incorporating representation learning and multihead attention to improve biomedical cross-sentence n-ary relation extraction [O] . Di Zhao, Jian Wang, Yijia Zhang, 2020

机译：纳入代表学习和多重关注以改善生物医学跨句N-ary关系提取
7. Learning Universal Sentence Representations with Mean-Max Attention Autoencoder [O] . Minghua Zhang, Yunfang Wu, Weikang Li, 2018

机译：学习普遍句子表示与均值 - 最大关注autoencoder

Learning Universal Sentence Representations with Mean-Max Attention Autoencoder

摘要

著录项

相似文献

相关主题

期刊订阅