New Feature Selection Method for Reinforcement Learning: Conditional Mutual Information Reveals Implicit State-Reward Dependency

Hirotaka HACHIYA; Masashi SUGIYAMA

首页> 外文期刊>電子情報通信学会技術研究報告. 情報論的学習理論と機械学習 >New Feature Selection Method for Reinforcement Learning: Conditional Mutual Information Reveals Implicit State-Reward Dependency

【24h】

New Feature Selection Method for Reinforcement Learning: Conditional Mutual Information Reveals Implicit State-Reward Dependency

机译：强化学习的新特征选择方法：条件互信息揭示了隐式状态-奖励依赖性

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Model-free reinforcement learning (RL) is a machine learning approach to decision making in unknown environment. However, real-world RL tasks often involve high-dimensional state space, and then standard RL methods do not perform well. In this paper, we propose a new feature selection framework for coping with high dimensionality. Our proposed framework adopts conditional mutual information between state and return sequences as a feature selection criterion, allowing the evaluation of implicit state-reward dependency. The conditional mutual information is approximated by a least-squares method, which results in a computationally efficient feature selection procedure. The usefulness of the proposed method is demonstrated on simulated mobile-robot navigation experiments.

机译：无模型强化学习（RL）是一种在未知环境中进行决策的机器学习方法。但是，现实中的RL任务通常涉及高维状态空间，因此标准的RL方法效果不佳。在本文中，我们提出了一个新的特征选择框架来应对高维。我们提出的框架采用状态和返回序列之间的条件互信息作为特征选择标准，从而允许评估隐式状态-奖励依赖性。条件互信息通过最小二乘法来近似，这导致计算效率高的特征选择过程。仿真的移动机器人导航实验证明了该方法的有效性。

著录项

来源
《電子情報通信学会技術研究報告. 情報論的学習理論と機械学習》 |2010年第76期|共8页
作者
Hirotaka HACHIYA; Masashi SUGIYAMA;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类机械学（机械设计基础理论）;信息理论;
关键词
Reinforcement learning; Feature selection; Conditional mutual information;

机译：强化学习;特征选择;有条件的相互信息;

相似文献

外文文献
中文文献
专利

1. New Feature Selection Method for Reinforcement Learning: Conditional Mutual Information Reveals Implicit State-Reward Dependency [J] . Hirotaka HACHIYA, Masashi SUGIYAMA 電子情報通信学会技術研究報告. 情報論的学習理論と機械学習 . 2010,第76期

机译：强化学习的新特征选择方法：条件互信息揭示了隐式状态-奖励依赖性
2. New Feature Selection Method for Reinforcement Learning: Conditional Mutual Information Reveals Implicit State-Reward Dependency [J] . Hirotaka HACHIYA, Masashi SUGIYAMA 電子情報通信学会技術研究報告. 情報論的学習理論と機械学習 . 2010,第76期

机译：强化学习的新特征选择方法：条件互信息揭示了隐式状态-奖励依赖性
3. New Feature Selection Method for Reinforcement Learning: Conditional Mutual Information Reveals Implicit State-Reward Dependency [J] . Hirotaka HACHIYA, Masashi SUGIYAMA 電子情報通信学会技術研究報告 . 2010,第76期

机译：强化学习的新特征选择方法：条件互信息揭示了隐式状态-奖励依赖性
4. Feature Selection for Reinforcement Learning: Evaluating Implicit State-Reward Dependency via Conditional Mutual Information [C] . Hirotaka Hachiya, Masashi Sugiyama ECML PKDD 2010;European conference on machine learning and knowledge discovery in databases . 2010

机译：强化学习的特征选择：通过条件互信息评估隐式状态-奖励依赖性
5. The Kernel Method of Density Estimation With Applications in Discrimination, Selection of Features, and Conditional and Marginal Distributions [D] . Girdwood Aitken, Colin Graeme. 1979

机译：密度估计的核方法及其在判别，特征选择以及条件和边际分布中的应用
6. Feature selection by optimizing a lower bound of conditional mutual information [O] . Hanyang Peng, Yong Fan -1

机译：通过优化条件互信息的下限来进行特征选择
7. Feature selection for reinforcement learning: Evaluating implicit state-reward dependency via conditional mutual information [O] . Hirotaka Hachiya, Masashi Sugiyama 2010

机译：强化学习的特征选择：通过条件互信息评估隐式状态 - 奖励依赖

New Feature Selection Method for Reinforcement Learning: Conditional Mutual Information Reveals Implicit State-Reward Dependency

摘要

著录项

相似文献

相关主题

期刊订阅