首页> 外文会议>Annual meeting of the Special Interest Group on Discourse and Dialogue >Knowledge transfer between speakers for personalised dialogue management
【24h】

Knowledge transfer between speakers for personalised dialogue management

机译:用于个性化对话管理的发言人之间的知识转移

获取原文

摘要

Model-free reinforcement learning has been shown to be a promising data driven approach for automatic dialogue policy optimization, but a relatively large amount of dialogue interactions is needed before the system reaches reasonable performance. Recently, Gaussian process based reinforcement learning methods have been shown to reduce the number of dialogues needed to reach optimal performance, and pre-training the policy with data gathered from different dialogue systems has further reduced this amount. Following this idea, a dialogue system designed for a single speaker can be initialised with data from other speakers, but if the dynamics of the speakers are very different the model will have a poor performance. When data gathered from different speakers is available, selecting the data from the most similar ones might improve the performance. We propose a method which automatically selects the data to transfer by defining a similarity measure between speakers, and uses this measure to weight the influence of the data from each speaker in the policy model. The methods are tested by simulating users with different severities of dysarthria interacting with a voice enabled environmental control system.
机译:无模型强化学习已经被证明是自动对话政策优化看好数据驱动的方法,但该系统达到合理的性能之前,需要较大量的对话互动。最近,基于高斯过程的加固学习方法已被证明减少了达到最佳性能所需的对话次数,并预先培训与不同对话系统收集的数据的政策进一步降低了这一金额。在此想法之后,可以使用其他扬声器的数据初始化为单个扬声器设计的对话系统,但如果扬声器的动态非常不同,模型的性能差。当从不同扬声器收集的数据可用时,从最相似的数据中选择数据可能会提高性能。我们提出了一种方法,它通过在扬声器之间定义相似度测量来自动选择要传输的数据,并使用该度量来重量策略模型中的每个扬声器的数据的影响。通过模拟具有不同传感器的用户与能够的语音的环境控制系统进行不同严重性来测试这些方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号