Extraction of Reward-Related Feature Space Using Correlation-Based and Reward-Based Learning Methods

机译：基于关联和基于奖励的学习方法提取与奖励相关的特征空间

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The purpose of this article is to present a novel learning paradigm that extracts reward-related low-dimensional state space by combining correlation-based learning like Input Correlation Learning (ICO learning) and reward-based learning like Reinforcement Learning (RL). Since ICO learning can quickly find a correlation between a state and an unwanted condition (e.g., failure), we use it to extract low-dimensional feature space in which we can find a failure avoidance policy. Then, the extracted feature space is used as a prior for RL. If we can extract proper feature space for a given task, a model of the policy can be simple and the policy can be easily improved. The performance of this learning paradigm is evaluated through simulation of a cart-pole system. As a result, we show that the proposed method can enhance the feature extraction process to find the proper feature space for a pole balancing policy. That is it allows a policy to effectively stabilize the pole in the largest domain of initial conditions compared to only using ICO learning or only using RL without any prior knowledge.

机译：本文的目的是提出一种新颖的学习范例，该方法通过将基于相关的学习（例如输入相关学习（ICO学习））和基于奖励的学习（例如强化学习）相结合来提取与奖励相关的低维状态空间。由于ICO学习可以快速找到状态与不想要的条件（例如故障）之间的相关性，因此我们使用它来提取低维特征空间，在该空间中我们可以找到避免故障的策略。然后，将提取的特征空间用作RL的先验。如果我们可以为给定任务提取适当的特征空间，则策略的模型可以很简单，并且可以轻松地改进策略。这种学习范式的性能是通过模拟磁极系统来评估的。结果表明，所提出的方法可以增强特征提取过程，从而为极点平衡策略找到合适的特征空间。也就是说，与仅使用ICO学习或仅使用RL而无需任何先验知识相比，它可以使策略在初始条件的最大范围内有效地稳定极点。

著录项

来源
《International conference on neural information processing;ICONIP 2010》|2011年|p.414-421|共8页
会议地点
作者
Poramate Manoonpong; Florentin Woergoetter; Jun Morimoto;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类信息处理（信息加工）;
关键词
unsupervised learning; reinforcement learning; neural control; sequential combination; pole balancing;

机译：无监督学习;强化学习;神经控制顺序组合极平衡;

相似文献

外文文献
中文文献
专利

1. A sequential feature extraction method based on discrete wavelet transform, phase space reconstruction, and singular value decomposition and an improved extreme learning machine for rolling bearing fault diagnosis [J] . Li D. Z., Zheng X., Xie Q. W., Proceedings of the Institution of Mechanical Engineers, Part E. Journal of Process Mechanical Engineering . 2018,第6期

机译：一种基于离散小波变换，相空间重构和奇异值分解的顺序特征提取方法和改进的滚动轴承故障诊断极端学习机
2. New learning subspace method for image feature extraction [J] . CAO Jian-hai, LI Long, LU Chang-hou Optoelectronics letters . 2006,第6期

机译：图像特征提取的新学习子空间方法
3. Automatic lung segmentation in CT images using mask R-CNN for mapping the feature extraction in supervised methods of machine learning using transfer learning [J] . Luis Fabricio Souza, Gabriel Holanda, Francisco Hercules Silva, International Journal of Hybrid Intelligent Systems . 2020,第4期

机译：使用掩模R-CNN的CT图像中的自动肺分割，用于使用转移学习在机器学习的监督方法中映射特征提取
4. Extraction of Reward-Related Feature Space Using Correlation-Based and Reward-Based Learning Methods [C] . Poramate Manoonpong, Florentin Worgotter, Jun Morimoto International Confernece on Neural Information Processing . 2010

机译：使用基于相关的基于奖励的学习方法提取奖励相关的特征空间
5. A features extraction wrapper method for neural networks, with application to data mining and machine learning [D] . Migdady, Hazem Moh'd 2013

机译：神经网络的特征提取包装方法，及其在数据挖掘和机器学习中的应用
6. A Comparative Survey of Feature Extraction and Machine Learning Methods in Diverse Acoustic Environments [O] . Daniel Bonet-Solà, Rosa Ma Alsina-Pagès 2021

机译：不同声学环境中特征提取和机器学习方法的比较调查
7. Extraction of reward-related feature space using correlationbased and reward-based learning methods [O] . Poramate Manoonpong, Jun Morimoto 2010

机译：使用基于相关和基于奖励的学习方法提取奖励相关的特征空间
8. Eigenvalue Properties of Projection Operators and Their Application to the Subspace Method of Feature Extraction. [R] . therrien,charles w. 1974

机译：投影算子的特征值性质及其在子空间特征提取中的应用。

Extraction of Reward-Related Feature Space Using Correlation-Based and Reward-Based Learning Methods

摘要

著录项

相似文献

相关主题

期刊订阅