Exploring Question-Specific Rewards for Generating Deep Questions

机译：探索特定问题的奖励，以产生深层问题

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recent question generation (QG) approaches often utilize the sequence-to-sequence framework (Seq2Seq) to optimize the log-likelihood of ground-truth questions using teacher forcing. However, this training objective is inconsistent with actual question quality, which is often reflected by certain global properties such as whether the question can be answered by the document. As such, we directly optimize for QG-specific objectives via reinforcement learning to improve question quality. We design three different rewards that target to improve the fluency, relevance, and answerability of generated questions. We conduct both automatic and human evaluations in addition to a thorough analysis to explore the effect of each QG-specific reward. We find that optimizing question-specific rewards generally leads to better performance in automatic evaluation metrics. However, only the rewards that correlate well with human judgement (e.g., relevance) lead to real improvement in question quality. Optimizing for the others, especially answerability, introduces incorrect bias to the model, resulting in poor question quality.

机译：最近的问题生成（QG）方法通常利用序列到序列框架（SEQ2Seq）来优化使用教师强制性的地面真实问题的日志似然性。然而，这种培训目标与实际问题质量不一致，这通常由某些全局属性反映，例如该问题是否可以由文档回答。因此，我们通过加强学习直接优化QG特定目标，以改善质量。我们设计了三种不同的奖励，该奖励来改善产生的问题的流畅性，相关性和可应答性。除了彻底的分析外，我们还进行自动和人类评估，以探索每个QG特定奖励的效果。我们发现优化质疑奖励通常会导致自动评估指标中的更好性能。然而，只有与人类判断（例如，相关性）相关的奖励导致了质量的真正改善。优化其他，特别是可应答性，将错误的偏差引入模型，导致质量不佳。

著录项

来源
《International Conference on Computational Linguistics》|2020年|2534-2546|共13页
会议地点
作者
Yuxi Xie; Liangming Pan; Dongzhe Wang; Min-Yen Kan; Yansong Feng;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. US GOM yields deep-water oil rewards for risk-averse explorers [J] . PETROLEUM ECONOMIST GROUP Petroleum Economist . 2009,第12期

机译：美国GOM产生风险厌恶探险家的深水油奖励
2. Exploring reward mechanisms on social question and answer Websites: quantifying the interdependences among user activities. [J] . Juan Chen, Yong Liu, Hongxiu Li Information Research . 2018,第2期

机译：探索社交问答网站上的奖励机制：量化用户活动之间的相互依赖性。
3. Role of inorganic colloids generated in a high-level deep geological repository in the migration of radionuclides: Open questions [J] . U. Alonso, T. Missana, H. Geckeis, Journal of Iberian geology . 2006,第1期

机译：高级深层地质库中产生的无机胶体在放射性核素迁移中的作用：悬而未决的问题
4. ECNU at SemEval-2016 Task 3: Exploring Traditional Method and Deep Learning Method for Question Retrieval and Answer Ranking in Community Question Answering [C] . Guoshun Wu, Man Lan International workshop on semantic evaluation;Conference of the North American Chapter of the Association for Computational Linguistics - Human Language Technologies . 2016

机译：ECNU在SemEval-2016上的任务3：探索传统方法和深度学习方法在社区问答中的问题检索和答案排名
5. Deep Reinforcement Learning with Accelerated Reward Function Technique for Robotics Task Planning [D] . Shaikh, Shifa. 2021

机译：机器人任务规划加速奖励功能技术的深增强学习
6. How do machine-generated questions compare to human-generated questions? [O] . Lishan Zhang, Kurt VanLehn -1

机译：机器生成的问题与人工生成的问题相比如何？
7. Exploring Question-Specific Rewards for Generating Deep Questions [O] . Yuxi Xie, Liangming Pan, Dongzhe Wang, 2020

机译：探索特定问题的奖励，以产生深层问题

Exploring Question-Specific Rewards for Generating Deep Questions

摘要

著录项

相似文献

相关主题

期刊订阅