首页> 外文会议>IEEE International Conference on Semantic Computing >Evaluating Architectural Choices for Deep Learning Approaches for Question Answering Over Knowledge Bases
【24h】

Evaluating Architectural Choices for Deep Learning Approaches for Question Answering Over Knowledge Bases

机译:评估用于知识库问题的深度学习方法的架构选择

获取原文

摘要

The task of answering natural language questions over knowledge bases has received wide attention in recent years. Various deep learning architectures have been proposed for this task. However, architectural design choices are typically not systematically compared nor evaluated under the same conditions. In this paper, we contribute to a better understanding of the impact of architectural design choices by evaluating four different architectures under the same conditions. We address the task of answering simple questions, consisting in predicting the subject and predicate of a triple given a question. In order to provide a fair comparison of different architectures, we evaluate them under the same strategy for inferring the subject, and compare different architectures for inferring the predicate. The architecture for inferring the subject is based on a standard LSTM model trained to recognize the span of the subject in the question and on a linking component that links the subject span to an entity in the knowledge base. The architectures for predicate inference are based on i) a standard softmax classifier ranging over all predicates as output, ii) a model that predicts a low-dimensional encoding of the property and subject entity, iii) a model that learns to score a pair of subject and predicate given the question as well as iv) a model based on the well-known FastText model. The comparison of architectures shows that FastText provides better results than other architectures.
机译:近年来,通过知识库回答自然语言问题的任务已引起广泛关注。已经针对此任务提出了各种深度学习架构。但是,通常不会在相同条件下对体系结构设计选择进行系统地比较或评估。在本文中,我们通过在相同条件下评估四种不同的体系结构,有助于更好地理解体系结构设计选择的影响。我们解决回答简单问题的任务,包括预测给定问题的三元组的主题和谓词。为了公平地比较不同的体系结构,我们在相同的策略下对它们进行评估以推断出主题,并比较不同的体系结构以推断谓词。用于推断主题的体系结构基于经过训练以识别问题中主题范围的标准LSTM模型以及将主题范围链接到知识库中的实体的链接组件。谓词推理的体系结构基于:i)一个标准的softmax分类器,范围覆盖所有谓词,作为输出; ii)预测属性和主题实体的低维编码的模型; iii)学会对一对得分进行评分的模型。主题和谓词给出了问题,以及iv)基于众所周知的FastText模型的模型。架构的比较表明,FastText提供了比其他架构更好的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号