...
首页> 外文期刊>IEICE transactions on information and systems >Sentence-Embedding and Similarity via Hybrid Bidirectional-LSTM and CNN Utilizing Weighted-Pooling Attention
【24h】

Sentence-Embedding and Similarity via Hybrid Bidirectional-LSTM and CNN Utilizing Weighted-Pooling Attention

机译:通过混合双向-LSTM和CNN利用加权汇集注意力的句子嵌入和相似性

获取原文
           

摘要

Neural networks have received considerable attention in sentence similarity measuring systems due to their efficiency in dealing with semantic composition. However, existing neural network methods are not sufficiently effective in capturing the most significant semantic information buried in an input. To address this problem, a novel weighted-pooling attention layer is proposed to retain the most remarkable attention vector. It has already been established that long short-term memory and a convolution neural network have a strong ability to accumulate enriched patterns of whole sentence semantic representation. First, a sentence representation is generated by employing a siamese structure based on bidirectional long short-term memory and a convolutional neural network. Subsequently, a weighted-pooling attention layer is applied to obtain an attention vector. Finally, the attention vector pair information is leveraged to calculate the score of sentence similarity. An amalgamation of both, bidirectional long short-term memory and a convolutional neural network has resulted in a model that enhances information extracting and learning capacity. Investigations show that the proposed method outperforms the state-of-the-art approaches to datasets for two tasks, namely semantic relatedness and Microsoft research paraphrase identification. The new model improves the learning capability and also boosts the similarity accuracy as well.
机译:由于它们在处理语义构成的效率,神经网络在句子相似度测量系统中受到了相当大的关注。然而,现有的神经网络方法在捕获在输入中掩埋的最重要的语义信息方面没有充分有效。为了解决这个问题,提出了一种新的加权汇集注意层来保持最显着的注意力矢量。已经确定了长期内记忆和卷积神经网络具有很强的积累整句语义表示的丰富模式的能力很强。首先,通过基于双向长期内记忆和卷积神经网络的暹罗结构来生成句子表示。随后,应用加权汇集注意层以获得注意矢量。最后,利用注意矢量对信息来计算句子相似度的分数。双向长期短期存储器和卷积神经网络的融合导致了一种增强信息提取和学习能力的模型。调查表明,该方法优于两项任务的最先进的方法,即语义相关性和Microsoft研究解释识别。新模型还提高了学习能力,也提高了相似性准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号