PRover: Proof Generation for Interpretable Reasoning over Rules

机译：箴言：证明是解释规则的可解释推理

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recent work by Clark et al. (2020) shows that transformers can act as "soft theorem provers" by answering questions over explicitly provided knowledge in natural language. In our work, we take a step closer to emulating formal theorem provers, by proposing PROVER, an interpretable transformer-based model that jointly answers binary questions over rule-bases and generates the corresponding proofs. Our model learns to predict nodes and edges corresponding to proof graphs in an efficient constrained training paradigm. During inference, a valid proof, satisfying a set of global constraints is generated. We conduct experiments on synthetic, hand-authored, and human-paraphrased rule-bases to show promising results for QA and proof generation, with strong generalization performance. First, PROVER generates proofs with an accuracy of 87%, while retaining or improving performance on the QA task, compared to RuleTak-ers (up to 6% improvement on zero-shot evaluation). Second, when trained on questions requiring lower depths of reasoning, it generalizes significantly better to higher depths (up to 15% improvement). Third, PROVER obtains near perfect QA accuracy of 98% using only 40% of the training data. However, generating proofs for questions requiring higher depths of reasoning becomes challenging, and the accuracy drops to 65% for "depth 5", indicating significant scope for future work.

机译：Clark等人最近的工作。（2020）表明，通过在自然语言中明确提供的知识的问题回答问题，变形金刚可以充当“软定理普罗瓦尔”。在我们的工作中，我们通过提出先报，迈出更接近正式的定理普通的普通定理普通的基于译文的基于译文的模型，该模型共同回答规则基础，并产生相应的证明。我们的模型学会在有效的受限训练范式中预测与证明图对相对应的节点和边缘。在推理期间，生成满足一组全局约束的有效证明。我们对综合，手工撰写和人类剖标的规则基础进行实验，以表明QA和证据的有希望的结果，具有强大的泛化性能。首先，与Ruletak-ers（零拍评估的提高高达6％）相比，先驱的证明书产生了87％的证据，而持续或提高了QA任务的性能。其次，当训练有于需要较低的推理深度的问题时，它会显着更好地拓展到更高的深度（高达15％的改进）。第三，通过仅使用40％的培训数据获得98％的完美QA精度接近完美的QA准确性。然而，为需要更高的推理深度的问题产生证据变得具有挑战性，并且“深度5”的精度下降至65％，表明未来工作的重要范围。

著录项

来源
《Conference on Empirical Methods in Natural Language Processing》|2020年|122-136|共15页
会议地点
作者
Swarnadeep Saha; Sayan Ghosh; Shashank Srivastava; Mohit Bansal;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Verifying B proof rules using deep embedding and automated theorem proving [J] . Melanie Jacquel, Karim Berkani, David Delahaye, Software and systems modeling . 2015,第1期

机译：使用深度嵌入和自动定理证明来验证B证明规则
2. Knowledge-aware path:interpretable graph reasoning in proactive dialogue generation [J] . Sun Yinan, Xu Yajing, Li Si, 中国邮电高校学报（英文版） . 2021,第001期

机译：知识意识的道路：主动对话一代中可解释的图形推理
3. A modularized contact-rule reasoning approach to the assembly sequence generation for product design [J] . Ming-Chyuan Lin, Chen-Cheng Lin, Yuan-Yeuan Tai, Concurrent engineering . 2012,第3期

机译：用于产品设计的组装顺序生成的模块化接触规则推理方法
4. multiPRover: Generating Multiple Proofs for Improved Interpretability in Rule Reasoning [C] . Swarnadeep Saha, Prateek Yadav, Mohit Bansal Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies . 2021

机译：MultiPRover：生成多种证明，以提高规则推理中的解释性
5. Parallelizing an Interactive Theorem Prover: Functional Programming and Proofs with ACL2. [D] . Rager, David Lawrence. 2012

机译：并行化交互式定理证明：使用ACL2进行函数式编程和证明。
6. Mechanisms of Rule Acquisition and Rule Following in Inductive Reasoning [O] . Cristiano Crescentini, Shima Seyed-Allaei, Nicola De Pisapia, 2011

机译：归纳推理中的规则获取和规则遵循机制
7. multiPRover: Generating Multiple Proofs for Improved Interpretability in Rule Reasoning [O] . Swarnadeep Saha, Prateek Yadav, Mohit Bansal 2021

机译：MultiProver：为改善规则推理的解释性产生多种证据
8. Interactive Compiler Proving Using Hoare Proof Rules. [R] . Lynn, D. S. 1978

机译：交互式编译器使用Hoare证明规则进行证明。

PRover: Proof Generation for Interpretable Reasoning over Rules

摘要

著录项

相似文献

相关主题

期刊订阅