...
首页> 外文期刊>Neurocomputing >A classification model for semantic entailment recognition with feature combination
【24h】

A classification model for semantic entailment recognition with feature combination

机译:具有特征组合的语义蕴涵识别的分类模型

获取原文
获取原文并翻译 | 示例
           

摘要

Recent years have witnessed the fast development of multimedia platforms in China, such as Youku, LeTV and Weibo. Images and videos are usually uploaded with textual descriptions, such as titles and introductions of these media. These texts are the key to multimedia content understanding, and this paper is dedicated to multimedia understanding with visual content entailment via recognizing semantic entailment in these texts. In fact, the natural language processing community has been manifesting increasing interest in semantic entailment recognition in English texts. Yet, so far not much attention has been paid to semantic entailment recognition in Chinese texts. Therefore, this paper investigates on multimedia semantic entailment with Chinese texts. Recognizing semantic entailment in Chinese texts can be cast as a classification problem. In this paper, a classification model is constructed based on support vector machine to detect high-level semantic entailment relations in Chinese text pair, including entailment and non-entailment for the Binary-Class and forward entailment, reverse entailment, bidirectional entailment, contradiction and independence for the Multi-Class. We explore different semantic feature combinations based on three kinds of Chinese textual features, including Chinese surface textual, Chinese lexical semantic and Chinese syntactic features, and utilize them to feed our classification model. The experiment results show that the accuracy of our classification model for semantic entailment recognition with the feature combination using all the three kinds of Chinese textual features achieves a much better performance than the baseline in Multi-Class and slightly better results than the baseline in the Binary-Class. (C) 2016 Elsevier B.V. All rights reserved.
机译:近年来,中国的优酷,乐视和微博等多媒体平台迅速发展。图片和视频通常带有文字说明,例如这些媒体的标题和介绍。这些文本是理解多媒体内容的关键,本文致力于通过识别这些文本中的语义蕴涵来实现具有视觉内容蕴涵的多媒体理解。实际上,自然语言处理社区对英语文本中的语义蕴涵识别越来越感兴趣。然而,到目前为止,中文文本中的语义蕴涵识别尚未引起足够的重视。因此,本文研究了中文文本的多媒体语义蕴涵。可以将中文文本中的语义蕴涵识别为分类问题。本文基于支持向量机构建了分类模型,以检测中文文本对中的高级语义包含关系,包括二元类的包含和非包含以及正向包含,反向包含,双向包含,矛盾和多类的独立性。我们基于汉语表面文本,汉语词汇语义和汉语句法特征三种汉语文本特征,探索了不同的语义特征组合,并利用它们来提供分类模型。实验结果表明,我们的分类模型在使用三种中文文本特征进行特征组合时的语义蕴涵识别的准确性要比Multi-Class中的基线好得多,而结果比Binary中的基线要好一些-类。 (C)2016 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号