【24h】

Maximum Entropy Context Models for Ranking Biographical Answers to Open-Domain Definition Questions

机译:用于对开放域定义问题的传记答案进行排名的最大熵上下文模型

获取原文

摘要

In the context of question-answering systems, there are several strategies for scoring candidate answers to definition queries including centroid vectors, bi-term and context language models. These techniques use only positive examples (i.e., descriptions) when building their models. In this work, a maximum entropy based extension is proposed for context language models so as to account for regularities across non-descriptions mined from web-snippets. Experiments show that this extension outperforms other strategies increasing the precision of the top five ranked answers by more than 5%. Results suggest that web-snippets are a cost-efficient source of non-descriptions, and that some relationships extracted from dependency trees are effective to mine for candidate answer sentences.
机译:在问答系统的上下文中,有几种对评分查询的候选答案进行评分的策略,包括质心向量,双向和上下文语言模型。这些技术在建立模型时仅使用正面示例(即描述)。在这项工作中,为上下文语言模型提出了一个基于最大熵的扩展,以便解决从Web片段中挖掘出来的非描述中的规律性。实验表明,此扩展优于其他策略,从而使排名前5位的答案的准确性提高了5%以上。结果表明,网页摘要是一种经济高效的非描述来源,并且从依存关系树中提取的某些关系可以有效地挖掘候选答案句子。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号