首页> 外文会议>Annual meeting of the Special Interest Group on Discourse and Dialogue >Knowledge transfer between speakers for personalised dialogue management
【24h】

Knowledge transfer between speakers for personalised dialogue management

机译:演讲者之间的知识转移,用于个性化对话管理

获取原文

摘要

Model-free reinforcement learning has been shown to be a promising data driven approach for automatic dialogue policy optimization, but a relatively large amount of dialogue interactions is needed before the system reaches reasonable performance. Recently, Gaussian process based reinforcement learning methods have been shown to reduce the number of dialogues needed to reach optimal performance, and pre-training the policy with data gathered from different dialogue systems has further reduced this amount. Following this idea, a dialogue system designed for a single speaker can be initialised with data from other speakers, but if the dynamics of the speakers are very different the model will have a poor performance. When data gathered from different speakers is available, selecting the data from the most similar ones might improve the performance. We propose a method which automatically selects the data to transfer by defining a similarity measure between speakers, and uses this measure to weight the influence of the data from each speaker in the policy model. The methods are tested by simulating users with different severities of dysarthria interacting with a voice enabled environmental control system.
机译:无模型强化学习已被证明是一种用于自动对话策略优化的有前途的数据驱动方法,但是在系统达到合理性能之前,需要相对大量的对话交互。最近,基于高斯过程的强化学习方法已显示减少达到最佳性能所需的对话数量,并且使用从不同对话系统收集的数据对策略进行预训练进一步减少了该数量。遵循这个想法,可以使用来自其他扬声器的数据来初始化为单个扬声器设计的对话系统,但是如果扬声器的动态变化很大,则该模型的性能将很差。当从不同说话者收集的数据可用时,从最相似的说话者中选择数据可能会提高性能。我们提出了一种方法,该方法通过定义说话者之间的相似性度量来自动选择要传输的数据,并使用该度量权重策略模型中来自每个说话者的数据的影响。通过模拟具有不同严重程度的构音障碍的用户与启用语音的环境控制系统进行交互,对这些方法进行了测试。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号