首页> 外文会议>Conference on Empirical Methods in Natural Language Processing >PRover: Proof Generation for Interpretable Reasoning over Rules
【24h】

PRover: Proof Generation for Interpretable Reasoning over Rules

机译:箴言:证明是解释规则的可解释推理

获取原文

摘要

Recent work by Clark et al. (2020) shows that transformers can act as "soft theorem provers" by answering questions over explicitly provided knowledge in natural language. In our work, we take a step closer to emulating formal theorem provers, by proposing PROVER, an interpretable transformer-based model that jointly answers binary questions over rule-bases and generates the corresponding proofs. Our model learns to predict nodes and edges corresponding to proof graphs in an efficient constrained training paradigm. During inference, a valid proof, satisfying a set of global constraints is generated. We conduct experiments on synthetic, hand-authored, and human-paraphrased rule-bases to show promising results for QA and proof generation, with strong generalization performance. First, PROVER generates proofs with an accuracy of 87%, while retaining or improving performance on the QA task, compared to RuleTak-ers (up to 6% improvement on zero-shot evaluation). Second, when trained on questions requiring lower depths of reasoning, it generalizes significantly better to higher depths (up to 15% improvement). Third, PROVER obtains near perfect QA accuracy of 98% using only 40% of the training data. However, generating proofs for questions requiring higher depths of reasoning becomes challenging, and the accuracy drops to 65% for "depth 5", indicating significant scope for future work.
机译:Clark等人最近的工作。 (2020)表明,通过在自然语言中明确提供的知识的问题回答问题,变形金刚可以充当“软定理普罗瓦尔”。在我们的工作中,我们通过提出先报,迈出更接近正式的定理普通的普通定理普通的基于译文的基于译文的模型,该模型共同回答规则基础,并产生相应的证明。我们的模型学会在有效的受限训练范式中预测与证明图对相对应的节点和边缘。在推理期间,生成满足一组全局约束的有效证明。我们对综合,手工撰写和人类剖标的规则基础进行实验,以表明QA和证据的有希望的结果,具有强大的泛化性能。首先,与Ruletak-ers(零拍评估的提高高达6%)相比,先驱的证明书产生了87%的证据,而持续或提高了QA任务的性能。其次,当训练有于需要较低的推理深度的问题时,它会显着更好地拓展到更高的深度(高达15%的改进)。第三,通过仅使用40%的培训数据获得98%的完美QA精度接近完美的QA准确性。然而,为需要更高的推理深度的问题产生证据变得具有挑战性,并且“深度5”的精度下降至65%,表明未来工作的重要范围。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号