首页> 外文会议>International Conference on Autonomous Agents and Multiagent Systems >Transfer Learning for User Adaptation in Spoken Dialogue Systems
【24h】

Transfer Learning for User Adaptation in Spoken Dialogue Systems

机译:在口语对话系统中转移学习以获取用户适应

获取原文

摘要

This paper focuses on user adaptation in Spoken Dialogue Systems. It is considered that the system has already been optimised with Reinforcement Learning methods for a set of users. The goal is to use and transfer this prior knowledge to adapt the system to a new user as quickly as possible without impacting asymptotic performance. The first contribution is a source selection method using a multi-armed stochastic bandit algorithm in order to improve the jumpstart, i.e. the average performance at the start of the learning curve. Contrarily to previous source selection methods, there is no need to define a metric between users, and it is parameter free. The second contribution is an innovative method for selecting the most informative transitions within the previously selected source, to improve the target model, in such a way that only transitions that were not observed with the target user are transferred from the selected source. For our experimentation, Reinforcement Learning is performed with the Fitted Q-Iteration algorithm. Both methods are validated on a negotiation game: an appointment scheduling simulator that allows the definition of simulated user models adopting diversified behaviours. Compared to state-of-the-art transfer algorithms, results show significant improvements for both jumpstart and asymptotic performance.
机译:本文重点介绍了用户对话系统的适应。据认为,该系统已经通过了一组用户的增强学习方法进行了优化。目标是使用并转移该事先知识,以便在不影响渐近性能的情况下尽快将系统适应新用户。第一种贡献是使用多武装随机强盗算法的源选择方法,以改善JumpStart,即学习曲线开始时的平均性能。与先前的源选择方法相反,不需要在用户之间定义度量,并且它是免费的。第二贡献是用于选择先前所选择的源中最具信息的转换的创新方法,以改进目标模型,以这种方式,即仅从所选择的源传送未观察到未观察到的过渡。对于我们的实验,用拟合Q迭代算法进行强化学习。两种方法都在协商游戏上验证:预约调度模拟器,允许定义模拟用户模型采用多样化行为。与最先进的转移算法相比,结果显示了JumpStart和渐近性能的显着改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号