首页> 外文会议>IEEE Workshop on Spoken Language Technology >Simultaneous feature selection and parameter optimization for training of dialog policy by reinforcement learning

【24h】

Simultaneous feature selection and parameter optimization for training of dialog policy by reinforcement learning

机译：通过加固学习培养对话策略的同时特征选择和参数优化

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper addresses the problem of feature selection in the reinforcement learning (RL) of the dialog policies of spoken dialog systems. A statistical dialog manager selects the system actions the system should take based on the features derived from the current dialog state and/or the system's belief state. When defining the features used by the system for training the dialog policy, however, finding a set of actually effective features from potentially useful ones is not obvious. In addition, the selection should be done simultaneously with the optimization of the dialog policy. In this paper, we propose an incremental feature selection method for the optimization of a dialog policy by RL, in which improvement of the dialog policy and the feature selection are conducted simultaneously. Experiments in dialog policy optimization by RL with a user simulator demonstrated the following: 1) that the proposed method can find a better dialog policy with fewer policy iterations and 2) the learning speed is comparable with the case where feature selection is conducted in advance.

机译：本文解决了对话策略的强化学习（RL）的功能选择问题。统计对话管理器选择系统操作，系统应基于从当前对话状态和/或系统的信仰状态派生的功能。然而，在定义系统使用的功能以进行培训对话策略时，查找来自潜在有用的系统的实际有效功能并不明显。此外，应使用对话策略的优化同时进行选择。在本文中，我们提出了一个增量特征选择方法，用于通过RL优化对话策略，从中同时进行对话策略和特征选择的改进。通过用户模拟器的RL对话策略优化的实验演示如下：1）所提出的方法可以找到更好的对话策略，策略迭代和2）学习速度与预先进行特征选择的情况相当。

著录项

来源
《IEEE Workshop on Spoken Language Technology》|2012年||共6页
会议地点
作者
Misu Teruhisa; Kashioka Hideki;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912-53;
关键词
Dialog management; Feature selection; Reinforcement learning; Spoken dialog systems;

机译：对话管理;特征选择;强化学习;口头对话系统;

相似文献

外文文献
中文文献
专利

1. Simultaneous feature and parameter selection using multiobjective optimization: application to named entity recognition [J] . Ekbal Asif, Saha Sriparna International journal of machine learning and cybernetics . 2016,第4期

机译：使用多目标优化同时进行特征和参数选择：在命名实体识别中的应用
2. A simulated-annealing-based approach for simultaneous parameter optimization and feature selection of back-propagation networks [J] . Shih-Wei Lin, Tsung-Yuan Tseng, Shuo-Yan Chou, Expert systems with applications . 2008,第2期

机译：一种基于模拟退火的反向传播网络参数同时优化和特征选择方法
3. A hybrid structure of an extreme learning machine combined with feature selection, signal decomposition and parameter optimization for short-term wind speed forecasting [J] . Sun Sizhou, Fu Jingqi, Zhu Feng, Transactions of the Institute of Measurement and Control . 2020,第1期

机译：极端学习机的混合结构与特征选择，信号分解和参数优化结合短期风速预测
4. Simultaneous feature selection and parameter optimization for training of dialog policy by reinforcement learning [C] . Misu Teruhisa, Kashioka Hideki 2012 IEEE Workshop on Spoken Language Technology. . 2012

机译：通过强化学习同时进行特征选择和参数优化以训练对话策略
5. Simultaneous Variable and Feature Group Selection in Heterogeneous Learning: Optimization and Applications. [D] . Xiang, Shuo. 2014

机译：异构学习中同时变量和特征组的选择：优化和应用。
6. An improved chaotic fruit fly optimization based on a mutation strategy for simultaneous feature selection and parameter optimization for SVM and its applications [O] . Fei Ye, Xin Yuan Lou, Lin Fu Sun -1

机译：改进的基于变异策略的混沌果蝇优化支持向量机同时特征选择和参数优化及其应用
7. Feature selection and policy optimization for distributed instruction placement using reinforcement learning [O] . Katherine E. Coons, Behnam Robatmili, Matthew E. Taylor, 2008

机译：基于强化学习的分布式指令布局特征选择与策略优化
8. Learning State Features from Policies to Bias Exploration in Reinforcement Learning [R] . Singer, B. , Veloso, M. 1999

机译：学习国家特色从政策到强化学习中的偏见探索

Simultaneous feature selection and parameter optimization for training of dialog policy by reinforcement learning

摘要

著录项

相似文献

相关主题

期刊订阅