A Dual Reinforcement Learning Framework for Unsupervised Text Style Transfer

机译：无监督文本转移的双重加固学习框架

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Unsupervised text style transfer aims to transfer the underlying style of text but keep its main content unchanged without parallel data. Most existing methods typically follow two steps: first separating the content from the original style, and then fusing the content with the desired style. However, the separation in the first step is challenging because the content and style interact in subtle ways in natural language. Therefore, in this paper, we propose a dual reinforcement learning framework to directly transfer the style of the text via a one-step mapping model, without any separation of content and style. Specifically, we consider the learning of the source-to-target and target-to-source mappings as a dual task, and two rewards are designed based on such a dual structure to reflect the style accuracy and content preservation, respectively. In this way, the two one-step mapping models can be trained via reinforcement learning, without any use of parallel data. Automatic evaluations show that our model outperforms the state-of-the-art systems by a large margin, especially with more than 8 BLEU points improvement averaged on two benchmark datasets. Human evaluations also validate the effectiveness of our model in terms of style accuracy, content preservation and fluency. Our code and data, including outputs of all baselines and our model are available at https://github.com/luofuli/DualRL.

机译：无监督的文本样式传输旨在转移底层文本风格，但不会保持其主要内容不变，而无需并行数据。大多数现有方法通常遵循两个步骤：首先将内容与原始风格分开，然后用所需的样式融合内容。然而，第一步中的分离是具有挑战性的，因为内容和风格以自然语言的微妙方式互动。因此，在本文中，我们提出了一种双重加强学习框架，可以通过一步映射模型直接传输文本的样式，而无需任何内容和样式。具体地，我们考虑将源到目标和目标到源映射的学习作为双重任务，并且基于这种双结构设计了两个奖励，以分别反映风格精度和内容保存。以这种方式，可以通过加强学习培训，这两个一步式映射模型，而不使用并行数据。自动评估表明，我们的模型通过大幅度优于最先进的系统，特别是在两个基准数据集上平均超过8个BLEU积分改进。人类评估还验证了我们模型在风格准确性，内容保存和流利程度方面的有效性。我们的代码和数据，包括所有基线的输出和我们的模型可在https://github.com/luofuli/dualrl上获得。

著录项

来源
《International Joint Conference on Artificial Intelligence》|2020年|5116-5849p|共7页
会议地点
作者
Fuli Luo; Peng Li; Jie Zhou; Pengcheng Yang; Baobao Chang; Xu Sun; Zhifang Sui;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词

相似文献

外文文献
中文文献
专利

1. Towards unsupervised text multi-style transfer with parameter-sharing scheme [J] . Chen Xi, Zhang Song, Shen Gehui, Neurocomputing . 2021,第Feba22期

机译：走向无监督的文本与参数共享方案的多种式转移
2. Rich-text document styling restoration via reinforcement learning [J] . Hongwei LI, Yingpeng HU, Yixuan CAO, Frontiers of computer science . 2021,第4期

机译：丰富的文本文档通过强化学习造型恢复
3. Three-stage reject inference learning framework for credit scoring using unsupervised transfer learning and three-way decision theory [J] . Shen Feng, Zhao Xingchao, Kou Gang Decision support systems . 2020,第Octa期

机译：三阶段拒绝推理学习框架，用于使用无监督转移学习和三向决策理论的信用评分
4. A Dual Reinforcement Learning Framework for Unsupervised Text Style Transfer [C] . Fuli Luo, Peng Li, Jie Zhou, International Joint Conference on Artificial Intelligence . 2020

机译：无监督文本转移的双重加固学习框架
5. A pedagogical framework for integrating individual learning style into an intelligent tutoring system [D] . Parvez, Shahida M. 2008

机译：将个人学习风格整合到智能辅导系统中的教学框架
6. Correction: Linking Individual Learning Styles to Approach-Avoidance Motivational Traits and Computational Aspects of Reinforcement Learning [O] . Kristoffer Carl Aberg, Kimberly C. Doell, Sophie Schwartz -1

机译：纠正：将个人学习风格与避免方法的动机特征和强化学习的计算方面联系起来
7. Reinforcement Learning Based Text Style Transfer without Parallel Training Corpus [O] . Hongyu Gong, Suma Bhat, Lingfei Wu, 2019

机译：基于加强学习的文本方式转移，没有平行训练语料库

A Dual Reinforcement Learning Framework for Unsupervised Text Style Transfer

摘要

著录项

相似文献

相关主题

期刊订阅