【24h】

Let's Ask Again: Refine Network for Automatic Question Generation

机译:再问一次:完善网络以自动生成问题

获取原文

摘要

In this work, we focus on the task of Automatic Question Generation (AQG) where given a passage and an answer the task is to generate the corresponding question. It is desired that the generated question should be (ⅰ) grammatically correct (ⅱ) answerable from the passage and (ⅲ) specific to the given answer. An analysis of existing AQG models shows that they produce questions which do not adhere to one or more of the above-mentioned qualities. In particular, the generated questions look like an incomplete draft of the desired question with a clear scope for refinement. To alleviate this shortcoming, we propose a method which tries to mimic the human process of generating questions by first creating an initial draft and then refining it. More specifically, we propose Refine Network (RefNet) which contains two decoders. The second decoder uses a dual attention network which pays attention to both (ⅰ) the original passage and (ⅱ) the question (initial draft) generated by the first decoder. In effect, it refines the question generated by the first decoder, thereby making it more correct and complete. We evaluate RefNet on three datasets, viz.. SQuAD, HOTPOT-QA, and DROP, and show that it outperforms existing state-of-the-art methods by 7-16% on all of these datasets. Lastly, we show that we can improve the quality of the second decoder on specific metrics, such as, fluency and answerability by explicitly rewarding revisions that improve on the corresponding metric during training. The code has been made publicly available~1.
机译:在这项工作中,我们专注于自动问题生成(AQG)的任务,其中给定段落和答案,任务是生成相应的问题。希望所生成的问题应是(ⅰ)语法正确(ⅱ)可以从段落中回答,并且(ⅲ)特定于给定的答案。对现有AQG模型的分析表明,它们产生的问题不符合一种或多种上述质量。特别是,生成的问题看起来像是所需问题的不完整草稿,具有明确的改进范围。为了缓解这一缺点,我们提出了一种方法,该方法试图模仿人类产生问题的过程,方法是先创建一个初始草稿,然后对其进行完善。更具体地说,我们提出了包含两个解码器的优化网络(RefNet)。第二个解码器使用双重注意网络,该网络同时注意(ⅰ)原始段落和(ⅱ)由第一个解码器生成的问题(初始草稿)。实际上,它细化了第一解码器生成的问题,从而使其更加正确和完整。我们在三个数据集上评估RefNet,即SQuAD,HOTPOT-QA和DROP,并显示在所有这些数据集上,RefNet的性能比现有的最新方法高7-16%。最后,我们表明,通过显式地奖励在培训过程中对相应指标进行改进的修订,我们可以在特定指标(例如流畅性和可回答性)上提高第二个解码器的质量。该代码已公开发布〜1。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号