Automatic Poetry Generation with Mutual Reinforcement Learning

机译：通过相互强化学习自动生成诗歌

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Poetry is one of the most beautiful forms of human language art. As a crucial step towards computer creativity, automatic poetry generation has drawn researchers' attention for decades. In recent years, some neural models have made remarkable progress in this task. However, they are all based on maximum likelihood estimation, which only learns common patterns of the corpus and results in loss-evaluation mismatch. Human experts evaluate poetry in terms of some specific criteria, instead of word-level likelihood. To handle this problem, we directly model the criteria and use them as explicit rewards to guide gradient update by reinforcement learning, so as to motivate the model to pursue higher scores. Besides, inspired by writing theories, we propose a novel mutual reinforcement learning schema. We simultaneously train two learners (generators) which learn not only from the teacher (rewarder) but also from each other to further improve performance. We experiment on Chinese poetry. Based on a strong basic model, our method achieves better results and outperforms the current state-of-the-art method.

机译：诗歌是人类语言艺术最美丽的形式之一。作为迈向计算机创造力的关键一步，自动诗歌生成已引起研究人员数十年的关注。近年来，一些神经模型在该任务中取得了显着进展。但是，它们全部基于最大似然估计，该最大似然估计仅学习语料库的常见模式并导致损失评估失配。人类专家根据某些特定标准而不是单词级别的可能性来评估诗歌。为了解决这个问题，我们直接对标准进行建模，并将其用作显式奖励，以通过强化学习来指导梯度更新，从而激励模型追求更高的分数。此外，受写作理论的启发，我们提出了一种新颖的相互强化学习模式。我们同时培训两名学习者（发电机），他们不仅向老师（奖励人）学习，而且还彼此学习，以进一步提高绩效。我们尝试中国诗歌。基于强大的基本模型，我们的方法取得了更好的结果，并且优于当前的最新方法。

著录项

来源
《Conference on empirical methods in natural language processing》|2018年|3143-3153|共11页
会议地点
作者
Xiaoyuan Yi; Maosong Sun; Ruoyu Li; Wenhao Li;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Multiple Action Sequence Learning and Automatic Generation for a Humanoid Robot Using RNNPB and Reinforcement Learning [J] . Takashi Kuremoto, Koichi Hashiguchi, Keita Morisaki, Journal of Software Engineering and Applications . 2012,第12期

机译：使用RNNPB和强化学习的类人机器人多动作序列学习和自动生成
2. A Multi-Step Unified Reinforcement Learning Method for Automatic Generation Control in Multi-Area Interconnected Power Grid [J] . Xi Lei, Zhou Lipeng, Xu Yanchun, Sustainable Energy, IEEE Transactions on . 2021,第2期

机译：多区域互联电网自动生成控制的多步统一增强学习方法
3. Automatic Goal Generation for Reinforcement Learning Agents [J] . Carlos Florensa, David Held, Xinyang Geng, JMLR: Workshop and Conference Proceedings . 2018,第2010期

机译：加固学习代理的自动目标生成
4. Automatic Poetry Generation with Mutual Reinforcement Learning [C] . Xiaoyuan Yi, Maosong Sun, Ruoyu Li, Conference on empirical methods in natural language processing . 2018

机译：自动诗歌生成相互钢筋学习
5. Mutual Reinforcement Learning to Improve Robots as Trainers [D] . Roy, Sayanti. 2020

机译：相互加强学习，将机器人改善为培训师
6. GadgetArm—Automatic Grasp Generation and Manipulation of 4-DOF Robot Arm for Arbitrary Objects Through Reinforcement Learning [O] . JoungMin Park, SangYoon Lee, JaeWoon Lee, 2020

机译：Gadgetarm-自动掌握4-DOF机器人手臂通过加固学习进行任意物体的生成和操纵
7. Automatic Poetry Generation with Mutual Reinforcement Learning [O] . Xiaoyuan Yi, Maosong Sun, Ruoyu Li, 2018

机译：自动诗歌生成相互钢筋学习

Automatic Poetry Generation with Mutual Reinforcement Learning

摘要

著录项

相似文献

相关主题

期刊订阅