Two forms of immediate reward reinforcement learning for exploratory data analysis.

Wu Y; Fyfe C; Lai PL

首页> 外文期刊>Neural Networks: The Official Journal of the International Neural Network Society >Two forms of immediate reward reinforcement learning for exploratory data analysis.

【24h】

Two forms of immediate reward reinforcement learning for exploratory data analysis.

机译：两种形式的即时奖励强化学习用于探索性数据分析。

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We review two forms of immediate reward reinforcement learning: in the first of these, the learner is a stochastic node while in the second the individual unit is deterministic but has stochastic synapses. We illustrate the first method on the problem of Independent Component Analysis. Four learning rules have been developed from the second perspective and we investigate the use of these learning rules to perform linear projection techniques such as principal component analysis, exploratory projection pursuit and canonical correlation analysis. The method is very general and simply requires a reward function which is specific to the function we require the unit to perform. We also discuss how the method can be used to learn kernel mappings and conclude by illustrating its use on a topology preserving mapping.

机译：我们回顾了两种形式的即时奖励强化学习：在第一种中，学习者是一个随机节点，而在第二种中，单个单元是确定性的但具有随机突触。我们说明了关于独立成分分析问题的第一种方法。从第二个角度已经开发了四个学习规则，我们研究了使用这些学习规则来执行线性投影技术，例如主成分分析，探索性投影追踪和规范相关分析。该方法非常通用，仅需要奖励功能，该功能特定于我们要求单位执行的功能。我们还将讨论如何将该方法用于学习内核映射，并通过说明其在拓扑保留映射上的用法来得出结论。

著录项

来源
《Neural Networks: The Official Journal of the International Neural Network Society》 |2008年第6期|共9页
作者
Wu Y; Fyfe C; Lai PL;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类神经病学;
关键词
Learning; Rewards; Reinforcement (Psychology); Two; physiological aspects; 学习; 强化(心理学);

机译：Learning;Rewards;Reinforcement (Psychology);Two;physiological aspects;学习;强化(心理学);

相似文献

外文文献
中文文献
专利

1. Two forms of immediate reward reinforcement learning for exploratory data analysis. [J] . Wu Y, Fyfe C, Lai PL Neural Networks: The Official Journal of the International Neural Network Society . 2008,第6期

机译：两种形式的即时奖励强化学习用于探索性数据分析。
2. Reinforcement Learning Based Adaptive Sampling: REAPing Rewards by Exploring Protein Conformational Landscapes [J] . Shamsi Zahra, Cheng Kevin J., Shukla Diwakar The journal of physical chemistry, B. Condensed matter, materials, surfaces, interfaces & biophysical . 2018,第35期

机译：基于加强学习的自适应抽样：通过探索蛋白质构象景观来获得奖励
3. Policy invariance under reward transformations for multi-objective reinforcement learning [J] . Mannion Patrick, Devlin Sam, Mason Karl, Neurocomputing . 2017,第nova8期

机译：奖励转换下多目标强化学习的策略不变性
4. Finding Exploratory Rewards by Embodied Evolution and Constrained Reinforcement Learning in the Cyber Rodents [C] . Eiji Uchibe, Kenji Doya International Conference on Neural Information Processing;ICONIP 2007 . 2008

机译：通过网络啮齿动物的典型进化和约束强化学习找到探索性奖励
5. Learning Policies for Model-Based Reinforcement Learning Using Distributed Reward Formulation [D] . Agarwal, Nikhil. 2021

机译：使用分布式奖励制定学习基于模型的强化学习的政策
6. Inferring reward prediction errors in patients with schizophrenia: a dynamic reward task for reinforcement learning [O] . Chia-Tzu Li, Wen-Sung Lai, Chih-Min Liu, 2014

机译：推断精神分裂症患者的奖励预测错误：强化学习的动态奖励任务
7. Framing Reinforcement Learning from Human Reward: Reward Positivity, Temporal Discounting, Episodicity, and Performance [O] . W. Bradley Knox, Peter Stone 2015

机译：从人类奖励中学习强化学习：奖励积极性，时间贴现，情节性和表现
8. Framing Reinforcement Learning from Human Reward: Reward Positivity, Temporal Discounting, Episodicity, and Performance. [R] . Knox, W. B., Stone, P. 2014

机译：从人类奖励中学习强化学习：奖励积极性，时间贴现，情节性和表现。

Two forms of immediate reward reinforcement learning for exploratory data analysis.

摘要

著录项

相似文献

相关主题

期刊订阅