Learning When Not to Answer: A Ternary Reward Structure for Reinforcement Learning based Question Answering

机译：学习何时不回答：基于学习的问答式强化三元奖励结构

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we investigate the challenges of using reinforcement learning agents for question-answering over knowledge graphs for real-world applications. We examine the performance metrics used by state-of-the-art systems and determine that they are inadequate for such settings. More specifically, they do not evaluate the systems correctly for situations when there is no answer available and thus agents optimized for these metrics are poor at modeling confidence. We introduce a simple new performance metric for evaluating question-answering agents that is more representative of practical usage conditions, and optimize for this metric by extending the binary reward structure used in prior work to a ternary reward structure which also rewards an agent for not answering a question rather than giving an incorrect answer. We show that this can drastically improve the precision of answered questions while only not answering a limited number of previously correctly answered questions. Employing a supervised learning strategy using depth-first-search paths to bootstrap the reinforcement learning algorithm further improves performance.

机译：在本文中，我们调查了使用强化学习代理对现实应用中的知识图进行问题解答的挑战。我们检查了最新系统使用的性能指标，并确定它们不足以进行此类设置。更具体地说，在没有可用答案的情况下，他们无法正确评估系统，因此针对这些指标进行优化的代理在建模信心方面很差。我们引入了一个用于评估问答代理的简单新绩效指标，该指标更能代表实际使用条件，并通过将先前工作中使用的二进制奖励结构扩展为三元奖励结构来对该指标进行优化，该三元奖励结构还会奖励未回答的代理商一个问题，而不是给出错误的答案。我们表明，这可以大大提高回答问题的准确性，而仅不回答有限数量的先前正确回答的问题。使用使用深度优先搜索路径的监督学习策略来引导强化学习算法，可以进一步提高性能。

著录项

来源
《Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies》|2019年|122-129|共8页
会议地点
作者
Frederic Godin; Anjishnu Kumar; Arpit Mittal;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. A reinforcement learning formulation to the complex question answering problem [J] . Yllias Chali, Sadid A. Hasan, Mustapha Mojahid Information Processing & Management . 2015,第3期

机译：复杂问题回答问题的强化学习公式
2. A Machine Learning-based Method for Question Type Classification in Biomedical Question Answering [J] . Sarrouti Mourad, El Alaoui Said Ouatik Methods of information in medicine . 2017,第3期

机译：基于机器学习类型的生物医学问题的分类方法
3. Finding similar questions in collaborative question answering archives: toward bootstrapping-based equivalent pattern learning [J] . Tianyong Hao, Eugene Agichtein Information retrieval . 2012,第3a4期

机译：在协作式问答档案中查找相似的问题：基于自举的等效模式学习
4. Learning When Not to Answer: A Ternary Reward Structure for Reinforcement Learning based Question Answering [C] . Frederic Godin, Anjishnu Kumar, Arpit Mittal Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies . 2019

机译：没有回答的时候学习：基于强化学习的问题的三元奖励结构
5. Improving Question Answering by Bridging Linguistic Structures and Statistical Learning [D] . Jurczyk, Tomasz. 2018

机译：通过桥接语言结构和统计学习来改善问答
6. COVID-19 information retrieval with deep-learning based semantic search question answering and abstractive summarization [O] . Andre Esteva, Anuprit Kale, Romain Paulus, 2021

机译：Covid-19信息检索与深学习的语义搜索问题应答和抽象摘要
7. Learning When Not to Answer: a Ternary Reward Structure for Reinforcement Learning Based Question Answering [O] . Fréderic Godin, Anjishnu Kumar, Arpit Mittal 2019

机译：没有回答的时候学习：基于强化学习的问题的三元奖励结构
8. Learning Strategy Training Program: Questions and Answers for Effective Learning. [R] . Dansereau, D. F., Long, G. L., McDonald, B. A., 1975

机译：学习策略培训计划：有效学习的问题和答案。

Learning When Not to Answer: A Ternary Reward Structure for Reinforcement Learning based Question Answering

摘要

著录项

相似文献

相关主题

期刊订阅