Extraction of Reward-Related Feature Space Using Correlation-Based and Reward-Based Learning Methods

机译：使用基于相关的基于奖励的学习方法提取奖励相关的特征空间

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The purpose of this article is to present a novel learning paradigm that extracts reward-related low-dimensional state space by combining correlation-based learning like Input Correlation Learning (ICO learning) and reward-based learning like Reinforcement Learning (RL). Since ICO learning can quickly find a correlation between a state and an unwanted condition (e.g., failure), we use it to extract low-dimensional feature space in which we can find a failure avoidance policy. Then, the extracted feature space is used as a prior for RL. If we can extract proper feature space for a given task, a model of the policy can be simple and the policy can be easily improved. The performance of this learning paradigm is evaluated through simulation of a cart-pole system. As a result, we show that the proposed method can enhance the feature extraction process to find the proper feature space for a pole balancing policy. That is it allows a policy to effectively stabilize the pole in the largest domain of initial conditions compared to only using ICO learning or only using RL without any prior knowledge.

机译：本文的目的是通过组合基于相关的学习（ICO学习）和基于奖励的学习（RL），提出一种新的学习范式，提取基于相关的学习（ICO学习）和基于奖励的学习（RL）的基于相关的学习和基于奖励的学习来提取奖励相关的低维状态空间。由于ICO学习可以快速找到状态和不需要的条件之间的相关性（例如，失败），我们使用它来提取我们可以找到失败避免策略的低维特征空间。然后，将提取的特征空间用作RL之前的用作。如果我们可以为给定任务提取适当的特征空间，则策略模型可以很简单，可以轻松提高策略。通过模拟推车杆系统来评估该学习范例的性能。结果，我们表明该方法可以增强特征提取过程以找到极限均衡策略的适当特征空间。这是允许一项策略与仅使用ICO学习或仅在没有任何先前知识的情况下使用RL相比，在初始条件的最大领域中有效地稳定杆的极限。

著录项

来源
《International Confernece on Neural Information Processing》|2010年||共8页
会议地点
作者
Poramate Manoonpong; Florentin Worgotter; Jun Morimoto;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP3-53;
关键词
Unsupervised learning; Reinforcement learning; Neural control; Sequential combination; Pole balancing;

机译：无人监督的学习;钢筋学习;神经控制;顺序组合;极衡;

相似文献

外文文献
中文文献
专利

1. A sequential feature extraction method based on discrete wavelet transform, phase space reconstruction, and singular value decomposition and an improved extreme learning machine for rolling bearing fault diagnosis [J] . Li D. Z., Zheng X., Xie Q. W., Proceedings of the Institution of Mechanical Engineers, Part E. Journal of Process Mechanical Engineering . 2018,第6期

机译：一种基于离散小波变换，相空间重构和奇异值分解的顺序特征提取方法和改进的滚动轴承故障诊断极端学习机
2. New learning subspace method for image feature extraction [J] . CAO Jian-hai, LI Long, LU Chang-hou Optoelectronics letters . 2006,第6期

机译：图像特征提取的新学习子空间方法
3. Automatic lung segmentation in CT images using mask R-CNN for mapping the feature extraction in supervised methods of machine learning using transfer learning [J] . Luis Fabricio Souza, Gabriel Holanda, Francisco Hercules Silva, International Journal of Hybrid Intelligent Systems . 2020,第4期

机译：使用掩模R-CNN的CT图像中的自动肺分割，用于使用转移学习在机器学习的监督方法中映射特征提取
4. Extraction of Reward-Related Feature Space Using Correlation-Based and Reward-Based Learning Methods [C] . Poramate Manoonpong, Florentin Woergoetter, Jun Morimoto International conference on neural information processing;ICONIP 2010 . 2011

机译：基于关联和基于奖励的学习方法提取与奖励相关的特征空间
5. A features extraction wrapper method for neural networks, with application to data mining and machine learning [D] . Migdady, Hazem Moh'd 2013

机译：神经网络的特征提取包装方法，及其在数据挖掘和机器学习中的应用
6. A Comparative Survey of Feature Extraction and Machine Learning Methods in Diverse Acoustic Environments [O] . Daniel Bonet-Solà, Rosa Ma Alsina-Pagès 2021

机译：不同声学环境中特征提取和机器学习方法的比较调查
7. Extraction of reward-related feature space using correlationbased and reward-based learning methods [O] . Poramate Manoonpong, Jun Morimoto 2010

机译：使用基于相关和基于奖励的学习方法提取奖励相关的特征空间
8. Eigenvalue Properties of Projection Operators and Their Application to the Subspace Method of Feature Extraction. [R] . therrien,charles w. 1974

机译：投影算子的特征值性质及其在子空间特征提取中的应用。

Extraction of Reward-Related Feature Space Using Correlation-Based and Reward-Based Learning Methods

摘要

著录项

相似文献

相关主题

期刊订阅