首页> 外文会议>Insternational Joint Conference on Natural Language Processing >Fast Reinforcement Learning of Dialogue Policies using Stable Function Approximation

【24h】

Fast Reinforcement Learning of Dialogue Policies using Stable Function Approximation

机译：使用稳定函数近似进行对话政策的快速加固学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We propose a method to speed up reinforcement learning of policies for spoken dialogue systems. This is achieved by learning the value of applying actions in selected states only. The value of unsampled states is approximated by a linear interpolation of known states. Experiments show that the improved algorithm speeds up the learning of dialogue policies.

机译：我们提出了一种加快加强对话系统政策的加强策略的方法。这是通过学习仅在所选状态中应用操作的价值来实现的。未拼接状态的值由已知状态的线性插值近似。实验表明，改进的算法加快了对话政策的学习。

著录项

来源
《Insternational Joint Conference on Natural Language Processing 》|2004年||共6页
会议地点
作者
Matthias Denecke; Kohji Dohsaka; Mikio Nakano; Association for Computational Linguistics(ACL); Association for Computational Linguistics and Chinese Language Processing(ACLCLP); Association of Natural Language Processing(ANLP);
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序语言、算法语言 ;
关键词

相似文献

外文文献
中文文献
专利

1. Adaptive importance sampling for value function approximation in off-policy reinforcement learning. [J] . Hachiya H, Akiyama T, Sugiayma Neural Networks: The Official Journal of the International Neural Network Society . 2009 ,第10期

机译：在非政策强化学习中用于价值函数逼近的自适应重要性抽样。
2. Towards integrated dialogue policy learning for multiple domains and intents using Hierarchical Deep Reinforcement Learning [J] . Saha Tulika, Gupta Dhawal, Saha Sriparna, Expert Systems with Application . 2020 ,第Deca期

机译：利用分层深度加强学习对多个域和意图的综合对话政策学习
3. Continuous-action reinforcement learning with fast policy search and adaptive basis function selection [J] . Xin Xu, Chunming Liu, Dewen Hu Soft Computing - A Fusion of Foundations, Methodologies and Applications . 2011 ,第6期

机译：具有快速策略搜索和自适应基函数选择的连续动作强化学习
4. Fast Reinforcement Learning of Dialogue Policies Using Stable Function Approximation [C] . International Joint Conference on Natural Language Processing . 2005

机译：使用稳定函数近似进行对话政策的快速加固学习
5. Min-Max Inverse Reinforcement Learning for Learning Bi-Modal Dialogue Policies [D] . Patil, Gandharv. 2020

机译：用于学习双模对话策略的最大最大逆钢筋学习
6. Towards sentiment aided dialogue policy learning for multi-intent conversations using hierarchical reinforcement learning [O] . Tulika Saha, Sriparna Saha, Pushpak Bhattacharyya 2020

机译：利用等级强化学习的多意图对话的情感对话策略学习
7. Constructing Continuous Action Space from Basis Functions for Fast and Stable Reinforcement Learning [O] . Akihiko Yamaguchi, Jun Takamatsu, Tsukasa Ogasawara 2015

机译：从基函数构建连续动作空间，实现快速稳定的强化学习

Fast Reinforcement Learning of Dialogue Policies using Stable Function Approximation

摘要

著录项

相似文献

相关主题

期刊订阅