Sentence-Embedding and Similarity via Hybrid Bidirectional-LSTM and CNN Utilizing Weighted-Pooling Attention

Degen HUANG; Anil AHMED; Syed Yasser ARAFAT; Khawaja Iftekhar RASHID; Qasim ABBAS; Fuji REN

首页> 外文期刊>IEICE transactions on information and systems >Sentence-Embedding and Similarity via Hybrid Bidirectional-LSTM and CNN Utilizing Weighted-Pooling Attention

【24h】

Sentence-Embedding and Similarity via Hybrid Bidirectional-LSTM and CNN Utilizing Weighted-Pooling Attention

机译：通过混合双向-LSTM和CNN利用加权汇集注意力的句子嵌入和相似性

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Neural networks have received considerable attention in sentence similarity measuring systems due to their efficiency in dealing with semantic composition. However, existing neural network methods are not sufficiently effective in capturing the most significant semantic information buried in an input. To address this problem, a novel weighted-pooling attention layer is proposed to retain the most remarkable attention vector. It has already been established that long short-term memory and a convolution neural network have a strong ability to accumulate enriched patterns of whole sentence semantic representation. First, a sentence representation is generated by employing a siamese structure based on bidirectional long short-term memory and a convolutional neural network. Subsequently, a weighted-pooling attention layer is applied to obtain an attention vector. Finally, the attention vector pair information is leveraged to calculate the score of sentence similarity. An amalgamation of both, bidirectional long short-term memory and a convolutional neural network has resulted in a model that enhances information extracting and learning capacity. Investigations show that the proposed method outperforms the state-of-the-art approaches to datasets for two tasks, namely semantic relatedness and Microsoft research paraphrase identification. The new model improves the learning capability and also boosts the similarity accuracy as well.

机译：由于它们在处理语义构成的效率，神经网络在句子相似度测量系统中受到了相当大的关注。然而，现有的神经网络方法在捕获在输入中掩埋的最重要的语义信息方面没有充分有效。为了解决这个问题，提出了一种新的加权汇集注意层来保持最显着的注意力矢量。已经确定了长期内记忆和卷积神经网络具有很强的积累整句语义表示的丰富模式的能力很强。首先，通过基于双向长期内记忆和卷积神经网络的暹罗结构来生成句子表示。随后，应用加权汇集注意层以获得注意矢量。最后，利用注意矢量对信息来计算句子相似度的分数。双向长期短期存储器和卷积神经网络的融合导致了一种增强信息提取和学习能力的模型。调查表明，该方法优于两项任务的最先进的方法，即语义相关性和Microsoft研究解释识别。新模型还提高了学习能力，也提高了相似性准确性。

著录项

来源
《IEICE transactions on information and systems》 |2020年第10期|共12页
作者
Degen HUANG; Anil AHMED; Syed Yasser ARAFAT; Khawaja Iftekhar RASHID; Qasim ABBAS; Fuji REN;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类
关键词
sentence similaritysentence embeddingdeep learninglong short-term memoryconvolutional neural network;

机译：句子相似度嵌入嵌入式Learninglong短期记忆volueNooldal网络;

相似文献

外文文献
中文文献
专利

1. Attention Based Multi-Patched 3D-CNNs with Hybrid Fusion Architecture for Reducing False Positives during Lung Nodule Detection [J] . Vamsi Krishna Vipparla, Premith Kumar Chilukuri, Giri Babu Kande Journal of Computer and Communications . 2021,第4期

机译：基于关注的多修补3D-CNN，具有混合融合架构，用于减少肺结核检测期间的误报
2. An Attention Mechanism Oriented Hybrid CNN-RNN Deep Learning Architecture of Container Terminal Liner Handling Conditions Prediction [J] . Bin Li, Yuqing He Computational intelligence and neuroscience . 2021,第a期

机译：一种注意机制型混合CNN-RNN集装箱终端衬里处理条件预测的深度学习架构
3. Attention Based Multi-Patched 3D-CNNs with Hybrid Fusion Architecture for Reducing False Positives during Lung Nodule Detection [J] . Vamsi Krishna Vipparla, Premith Kumar Chilukuri, Giri Babu Kande 电脑和通信（英文） . 2021,第004期

机译：基于关注的多修补3D-CNN，具有混合融合架构，用于减少肺结核检测期间的误报
4. Utilization of Residual CNN-GRU With Attention Mechanism for Classification of 12-lead ECG [C] . Petr Nejedly, Adam Ivora, Ivo Viscor, Computing in Cardiology . 2020

机译：利用剩余CNN-GRU对12引导ECG分类的注意机制
5. Computational modeling and utilization of attention, surprise and attention gating. [D] . Mundhenk, Terrell Nathan. 2009

机译：计算模型和注意力，惊奇和注意力门控的利用。
6. An Attention Mechanism Oriented Hybrid CNN-RNN Deep Learning Architecture of Container Terminal Liner Handling Conditions Prediction [O] . Bin Li, Yuqing He 2021

机译：一种注意机制型混合CNN-RNN集装箱终端衬里处理条件预测的深度学习架构
7. Attention Based Multi-Patched 3D-CNNs with Hybrid Fusion Architecture for Reducing False Positives during Lung Nodule Detection [O] . Vamsi Krishna Vipparla, Premith Kumar Chilukuri, Giri Babu Kande 2021

机译：基于关注的多修补3D-CNN，具有混合融合架构，用于减少肺结核检测期间的误报

Sentence-Embedding and Similarity via Hybrid Bidirectional-LSTM and CNN Utilizing Weighted-Pooling Attention

摘要

著录项

相似文献

相关主题

期刊订阅