Hierarchical reinforcement learning for situated natural language generation

NINA DETHLEFS; HERIBERTO CUAYAHUITL

首页> 外文期刊>Natural language engineering >Hierarchical reinforcement learning for situated natural language generation

【24h】

Hierarchical reinforcement learning for situated natural language generation

机译：分层强化学习以生成自然语言

获取原文

获取原文并翻译 | 示例

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Natural Language Generation systems in interactive settings often face a multitude of choices, given that the communicative effect of each utterance they generate depends crucially on the interplay between its physical circumstances, addressee and interaction history. This is particularly true in interactive and situated settings. In this paper we present a novel approach for situated Natural Language Generation in dialogue that is based on hierarchical reinforcement learning and learns the best utterance for a context by optimisation through trial and error. The model is trained from human-human corpus data and learns particularly to balance the trade-off between efficiency and detail in giving instructions: the user needs to be given sufficient information to execute their task, but without exceeding their cognitive load. We present results from simulation and a task-based human evaluation study comparing two different versions of hierarchical reinforcement learning: One operates using a hierarchy of policies with a large state space and local knowledge, and the other additionally shares knowledge across generation subtasks to enhance performance. Results show that sharing knowledge across subtasks achieves better performance than learning in isolation, leading to smoother and more successful interactions that are better perceived by human users.

机译：交互式环境中的自然语言生成系统通常会面临多种选择，因为它们生成的每种语音的交流效果都主要取决于其物理环境，收件人和交互历史之间的相互作用。在交互式和环境设置中尤其如此。在本文中，我们提出了一种用于对话中的自然语言生成的新方法，该方法基于层次强化学习，并通过反复试验的优化来学习针对上下文的最佳话语。该模型是从人与人的语料库数据中训练而来的，尤其要学习在给出指令的效率和细节之间进行权衡：需要向用户提供足够的信息来执行任务，但又不超出他们的认知负担。我们提供了来自仿真和基于任务的人类评估研究的结果，该研究比较了两种不同版本的分层强化学习：一种使用具有较大状态空间和本地知识的策略分层结构进行操作，另一种使用跨子任务共享知识以提高性能。结果表明，与单独学习相比，跨子任务共享知识可获得更好的性能，从而使交互更顺畅，更成功，人类用户会更好地感知。

著录项

来源
《Natural language engineering》 |2015年第5期|391-435|共45页
作者
NINA DETHLEFS; HERIBERTO CUAYAHUITL;
展开▼
作者单位

Mathematical and Computer Sciences, Heriot-Watt University, Edinburgh, UK;

Mathematical and Computer Sciences, Heriot-Watt University, Edinburgh, UK;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Using reinforcement learning with external rewards for open-domain natural language generation [J] . Srinivasan Vidhushini, Santhanam Sashank, Shaikh Samira Journal of Intelligent Information Systems . 2021,第1期

机译：使用强化学习与外部奖励进行开放式自然语言生成
2. A Proposal for an Integrated Evaluation Framework for Mobile Language Learning: Lessons Learned from SIMOLA - Situated Mobile Language Learning [J] . Annamaria Cacchione, Emma Procter-Legg, Sobah Abbas Petersen, Journal of Universal Computer Science . 2015,第10期

机译：关于移动语言学习的综合评估框架的提案：从SIMOLA中学到的教训-定位的移动语言学习
3. A Proposal for an Integrated Evaluation Framework for Mobile Language Learning: Lessons Learned from SIMOLA - Situated Mobile Language Learning [J] . Annamaria Cacchione, Emma Procter-Legg, Sobah Abbas Petersen, Journal of Universal Computer Science . 2015,第10期

机译：关于移动语言学习的综合评估框架的提案：从SIMOLA中学到的教训-定位的移动语言学习
4. Combining Hierarchical Reinforcement Learning and Bayesian Networks for Natural Language Generation in Situated Dialogue [C] . Nina Dethlefs, Heriberto Cuayáhuitl NLG 11 . 2012

机译：在位于对话中结合分层加固学习和贝叶斯网络的自然语言生成
5. Learning Hierarchical Compositional Task Definitions through Online Situated Interactive Language Instruction [D] . Kirk, James R. 2019

机译：通过在线定位的交互式语言指令学习分层组成任务定义
6. Influence of Perceptual Saliency Hierarchy on Learning of Language Structures: An Artificial Language Learning Experiment [O] . Tao Gong, Yau W. Lam, Lan Shuai -1

机译：感知显着性等级对语言结构学习的影响：一个人工语言学习实验
7. Intentional context in situated natural language learning [O] . Michael Fleischman, Deb Roy 2005

机译：情境自然语言学习中的意向语境

Hierarchical reinforcement learning for situated natural language generation

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅