SACNN: Self-attentive Convolutional Neural Network Model for Natural Language Inference

Quamer Waris; Jain Praphula Kumar; Rai Arpit; Saravanan Vijayalakshmi; Pamula Rajendra; Kumar Chiranjeev

首页> 外文期刊>ACM transactions on Asian and low-resource language information processing >SACNN: Self-attentive Convolutional Neural Network Model for Natural Language Inference

【24h】

SACNN: Self-attentive Convolutional Neural Network Model for Natural Language Inference

机译：SACNN：自我临床卷积神经网络模型，用于自然语言推断

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Inference has been central problem for understanding and reasoning in artificial intelligence. Especially, Natural Language Inference is an interesting problem that has attracted the attention of many researchers. Natural language inference intends to predict whether a hypothesis sentence can be inferred from the premise sentence. Most prior works rely on a simplistic association between the premise and hypothesis sentence pairs, which is not sufficient for learning complex relationships between them. The strategy also fails to exploit local context information fully. Long Short Term Memory (LSTM) or gated recurrent units networks (GRU) are not effective in modeling long-term dependencies, and their schemes are far more complex as compared to Convolutional Neural Networks (CNN). To address this problem of long-term dependency, and to involve context for modeling better representation of a sentence, in this article, a general Self-Attentive Convolution Neural Network (SACNN) is presented for natural language inference and sentence pair modeling tasks. The proposed model uses CNNs to integrate mutual interactions between sentences, and each sentence with their counterparts is taken into consideration for the formulation of their representation. Moreover, the selfattention mechanism helps fully exploit the context semantics and long-term dependencies within a sentence. Experimental results proved that SACNN was able to outperform strong baselines and achieved an accuracy of 89.7% on the stanford natural language inference (SNLI) dataset.

机译：推理是人工智能理解和推理的核心问题。特别是，自然语言推论是一种有趣的问题，它引起了许多研究人员的注意。自然语言推断打算预测可以从前提句子中推断出假设判决。大多数事先作品依赖于前提和假设句子对之间的简单关联，这不足以学习它们之间的复杂关系。该策略还会无法充分利用本地上下文信息。长短期内存（LSTM）或门控复发单位网络（GRU）在建模长期依赖性方面无效，与卷积神经网络（CNN）相比，它们的方案更复杂。为了解决长期依赖性的这个问题，并且涉及用于建模更好地表达句子的背景，在本文中，呈现了一种用于自然语言推理和句子对建模任务的一般自我细心卷积神经网络（SACNN）。所提出的模型使用CNN来集成句子之间的相互相互作用，并且考虑到其代表的制定的每个句子。此外，自助活动机制有助于充分利用句子中的上下文语义和长期依赖性。实验结果证明，SACNN能够优于强大的基线，并在斯坦福自然语言推理（SNLI）数据集中实现了89.7％的准确性。

著录项

来源
《ACM transactions on Asian and low-resource language information processing》 |2021年第3期|50.1-50.16|共16页
作者
Quamer Waris; Jain Praphula Kumar; Rai Arpit; Saravanan Vijayalakshmi; Pamula Rajendra; Kumar Chiranjeev;
展开▼
作者单位

Indian Inst Technol ISM Dhanbad Dhanbad 826004 Jharkhand India;

Indian Inst Technol ISM Dhanbad Dhanbad 826004 Jharkhand India;

Indian Inst Technol ISM Dhanbad Dhanbad 826004 Jharkhand India;

Rochester Inst Technol Dept Software Engn Rochester NJ USA;

Indian Inst Technol ISM Dhanbad 826004 Jharkhand India;

Indian Inst Technol ISM Dhanbad 826004 Jharkhand India;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Natural language inference; convolutional neural network; attention; machine learning;

机译：自然语言推断;卷积神经网络;注意;机器学习;

相似文献

外文文献
中文文献
专利

1. SACNN: Self-Attention Convolutional Neural Network for Low-Dose CT Denoising With Self-Supervised Perceptual Loss Network [J] . Li Meng, Hsu William, Xie Xiaodong, IEEE Transactions on Medical Imaging . 2020,第7期

机译：SACNN：具有自我监督感知损失网络的低剂量CT去噪的自我关注卷积神经网络
2. Fault Detection and Diagnosis Using Self-Attentive Convolutional Neural Networks for Variable-Length Sensor Data in Semiconductor Manufacturing [J] . Kim Eunji, Cho Sungzoon, Lee Byeongeon, IEEE Transactions on Semiconductor Manufacturing . 2019,第3期

机译：半导体制造中基于变心传感器数据的自专心卷积神经网络的故障检测与诊断
3. American Sign Language alphabet recognition using Convolutional Neural Networks with multiview augmentation and inference fusion [J] . Wenjin Tao, Ming C. Leu, Zhaozheng Yin Engineering Applications of Artificial Intelligence . 2018,第NOVa期

机译：使用卷积神经网络结合多视图增强和推理融合的美国手语字母识别
4. Context-Aware Tree-Based Convolutional Neural Networks for Natural Language Inference [C] . Zhao Meng, Lili Mou, Ge Li, International conference on knowledge science, engineering and management . 2016

机译：用于自然语言推理的基于上下文感知树的卷积神经网络
5. Deep Neural Language Model for Text Classification Based on Convolutional and Recurrent Neural Networks [D] . Hassan, Abdalraouf. 2018

机译：基于卷积神经网络和递归神经网络的深度神经语言文本分类模型
6. Can natural language processing help differentiate inflammatory intestinal diseases in China? Models applying random forest and convolutional neural network approaches [O] . Yuanren Tong, Keming Lu, Yingyun Yang, 2020

机译：自然语言加工可以有助于区分中国的炎症性肠疾病吗？应用随机森林和卷积神经网络方法的模型
7. A Self-Attentive Convolutional Neural Networks for Emotion Classification on User-Generated Contents [O] . Ying Qian, Weiwei Liu, Jiangping Huang 2020

机译：用于用户生成内容的情感分类的自我细分卷积神经网络

SACNN: Self-attentive Convolutional Neural Network Model for Natural Language Inference

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅