Using reinforcement learning with external rewards for open-domain natural language generation

Srinivasan Vidhushini; Santhanam Sashank; Shaikh Samira

首页> 外文期刊>Journal of Intelligent Information Systems >Using reinforcement learning with external rewards for open-domain natural language generation

【24h】

Using reinforcement learning with external rewards for open-domain natural language generation

机译：使用强化学习与外部奖励进行开放式自然语言生成

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We propose a new approach towards emotional natural language generation using bidirectional seq2seq model. Our goal is to generate emotionally relevant language that accommodates the emotional tone of the prior context. To incorporate emotional information, we train our own embeddings appended with emotion values through valence, arousal and dominance scores. We use a reinforcement-learning framework, which is tuned using policy gradient method. Two of the internal rewards in our reinforcement learning framework, viz. Ease of Answering and Semantic Coherence are based on prior state-of-the-art. We propose a new internal reward, Emotional Intelligence, computed by minimizing the affective dissonance between the source and generated text. We also train a separate external reward analyzer to predict the rewards as well as to maximize the expected rewards (both internal and external). We evaluate the system on two common corpora used for Natural Language Generation tasks: the Cornell Movie Dialog and Yelp Restaurant Review Corpus. We report standard evaluation metrics including BLEU, ROUGE-L and perplexity as well as human evaluation to validate our approach. We demonstrate the ability of proposed model to generate emotionally appropriate responses on both corpora.

机译：我们向使用双向SEQ2SEQ模型提出了一种新的情绪自然语言生成方法。我们的目标是产生情绪相关的语言，以满足现有背景的情感语调。为了合并情感信息，我们培养我们自己的嵌入，通过价值，唤醒和统治分数附加情感值。我们使用强化学习框架，使用策略梯度方法进行调整。我们加强学习框架中的两个内部奖励，viz。易于应答和语义连贯性基于现有最先进的。我们提出了一种新的内部奖励，情商，通过最大限度地减少来源和生成文本之间的情感解散来计算。我们还培训一个单独的外部奖励分析仪来预测奖励，并最大限度地提高预期的奖励（内部和外部）。我们评估了用于自然语言生成任务的两个共同的语料库：康奈尔电影对话框和yelp餐厅评论语料库。我们报告标准评估指标，包括Bleu，Rouge-L和困惑以及人类评估，以验证我们的方法。我们展示了拟议模型在同类公司产生情绪适当反应的能力。

著录项

来源
《Journal of Intelligent Information Systems》 |2021年第1期|189-206|共18页
作者
Srinivasan Vidhushini; Santhanam Sashank; Shaikh Samira;
展开▼
作者单位

Univ N Carolina Dept Comp Sci Charlotte NC 28223 USA;

Univ N Carolina Dept Comp Sci Charlotte NC 28223 USA;

Univ N Carolina Dept Comp Sci Charlotte NC 28223 USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Deep learning; Reinforcement learning; Emotional intelligence; Human feedback; Seq2seq learning; Conversational agent; Natural language generation;

机译：深入学习;加强学习;情商;人体反馈;SEQ2SEQ学习;会话代理;自然语言生成;

相似文献

外文文献
中文文献
专利

1. Hierarchical reinforcement learning for situated natural language generation [J] . NINA DETHLEFS, HERIBERTO CUAYAHUITL Natural language engineering . 2015,第may期

机译：分层强化学习以生成自然语言
2. Introduction of Fixed Mode States into Online Reinforcement Learning with Penalties and Rewards and its Application to Biped Robot Waist Trajectory Generation [J] . Seiya Kuroda, Kazuteru Miyazaki, Hiroaki Kobayashi Journal of Advanced Computatioanl Intelligence and Intelligent Informatics . 2012,第6a94期

机译：将固定模式状态引入带有惩罚和奖励的在线强化学习中，并将其应用于两足机器人腰部弹道
3. On the Generation of E-Learning Resources Using Business Process, Natural Language Processing, and Web Services [J] . Graciela Fragoso-Diaz Olivia, Lopez-Caballero Vitervo, Carlos Rojas-Perez Juan, IT professional . 2021,第2期

机译：在使用业务流程，自然语言处理和Web服务的生成电子学习资源
4. Implementation of Language-Action Reward Network in Reinforcement Learning by Using Natural Language [C] . Sagiraju Hima Keerthi IEEE India Council International Subsections Conference . 2020

机译：使用自然语言实现强化学习中的语言行动奖励网络
5. Deep Reinforcement Learning in Natural Language Scenarios [D] . He, Ji. 2017

机译：自然语言场景中的深度强化学习
6. Inferring reward prediction errors in patients with schizophrenia: a dynamic reward task for reinforcement learning [O] . Chia-Tzu Li, Wen-Sung Lai, Chih-Min Liu, 2014

机译：推断精神分裂症患者的奖励预测错误：强化学习的动态奖励任务
7. Logical Natural Language Generation from Open-Domain Tables [O] . Wenhu Chen, Jianshu Chen, Yu Su, 2020

机译：开放式域表的逻辑自然语言生成

Using reinforcement learning with external rewards for open-domain natural language generation

摘要

著录项

相似文献

相关主题

期刊订阅