首页> 外文期刊>Knowledge-Based Systems >Integrating statistical and lexical information for recognizing textual entailments in text
【24h】

Integrating statistical and lexical information for recognizing textual entailments in text

机译:整合统计信息和词汇信息以识别文本中的文字含义

获取原文
获取原文并翻译 | 示例
       

摘要

Recognizing textual entailment is to infer that a given text span follows from the meaning of a given hypothesis. To have better recognition capability, it is necessary to employ deep text processing units such as syntactic parsers and semantic taggers. However, these resources are not usually available in other non-English languages. In this paper, we present a light-weight Chinese textual entailment recognition system using part-of-speech information only. We designed two different feature models from training data and employed the well-known kernel method to learn to predict testing data. One feature set abstracts the generic statistics between the text pairs, while the other set directly models lexical features based on the traditional bag-of-words model. The ability of the proposed feature models not only brings additional statistical information from their datasets but also helps to enhance the prediction capability. To validate this, we conducted the experiments on the novel benchmark corpus - NTCIR-RITE-2011. The empirical results demonstrate that our method achieves the best results in comparison to the other competitors. In terms of accuracy, our method achieves 54.77% for the NTC1R RITE MC task.
机译:认识到文字蕴含是要根据给定假设的含义来推断给定的文本跨度。为了具有更好的识别能力,有必要采用深度文本处理单元,例如语法解析器和语义标记器。但是,通常不会以其他非英语语言提供这些资源。在本文中,我们提出了一种仅使用词性信息的轻量级中文文本蕴涵识别系统。我们从训练数据中设计了两个不同的特征模型,并采用众所周知的核方法学习预测测试数据。一个功能集抽象了文本对之间的通用统计信息,而另一个功能集直接基于传统的词袋模型对词汇功能进行建模。所提出的特征模型的功能不仅从其数据集中带来了额外的统计信息,而且还有助于增强预测能力。为了验证这一点,我们对新型基准语料库NTCIR-RITE-2011进行了实验。实验结果表明,与其他竞争对手相比,我们的方法取得了最佳结果。在准确性方面,我们的方法对NTC1R RITE MC任务达到了54.77%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号