首页> 外文会议>International Joint Conference on Neural Networks >Integrating word embeddings and traditional NLP features to measure textual entailment and semantic relatedness of sentence pairs
【24h】

Integrating word embeddings and traditional NLP features to measure textual entailment and semantic relatedness of sentence pairs

机译:集成词嵌入和传统NLP功能以测量句子对的文本含意度和语义相关性

获取原文

摘要

Recent years the distributed representations of words (i.e., word embeddings) have been shown to be able to significantly improve performance in many natural language processing tasks, such as pos-of-tag tagging, chunking, named entity recognition and sentiment polarity judgement, etc. However, previous tasks only involve a single sentence. In contrast, this paper evaluates the effectiveness of word embeddings in sentence pair classification or regression problems. Specifically, we propose novel simple yet effective features based on word embeddings and extract many traditional linguistic features. Then these features serve as input of a classification/regression algorithm in isolation and in combination. Evaluations are conducted on three sentence pair classification/regression tasks, i.e., textual entailment, cross-lingual textual entailment and semantic relatedness estimation. Experiments on benchmark datasets provided by Semantic Evaluation 2013 and 2014 showed that using word embeddings is able to significantly improve the performance and our results outperform the best achieved results so far.
机译:近年来,单词的分布式表示(即单词嵌入)已被证明能够显着提高许多自然语言处理任务的性能,例如标记后标记,分块,命名实体识别和情感极性判断等。但是,以前的任务仅涉及一个句子。相反,本文评估了词嵌入在句子对分类或回归问题中的有效性。具体来说,我们提出了一种基于词嵌入的新颖简单而有效的特征,并提取了许多传统的语言特征。然后,这些特征可以单独或组合用作分类/回归算法的输入。对三个句子对的分类/回归任务进行评估,即文本范围,跨语言文本范围和语义相关性估计。由语义评估2013和2014提供的基准数据集的实验表明,使用词嵌入可以显着提高性能,并且我们的结果优于迄今为止取得的最佳结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号