A new approach to the design of reinforcement schemes for learning automata: stochastic estimator learning algorithms

Papadimitriou G.I.

首页> 外文期刊>IEEE Transactions on Knowledge and Data Engineering >A new approach to the design of reinforcement schemes for learning automata: stochastic estimator learning algorithms

【24h】

A new approach to the design of reinforcement schemes for learning automata: stochastic estimator learning algorithms

机译：设计用于学习自动机的增强方案的新方法：随机估计器学习算法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

A new class of learning automata is introduced. The new automata use a stochastic estimator and are able to operate in nonstationary environments with high accuracy and a high adaptation rate. According to the stochastic estimator scheme, the estimates of the mean rewards of actions are computed stochastically. So, they are not strictly dependent on the environmental responses. The dependence between the stochastic estimates and the deterministic estimator's contents is more relaxed when the latter are old and probably invalid. In this way, actions that have not been selected recently have the opportunity to be estimated as "optimal", to increase their choice probability, and, consequently, to be selected. Thus, the estimator is always recently updated and consequently is able to be adapted to environmental changes. The performance of the Stochastic Estimator Learning Automaton (SELA) is superior to the previous well-known S-model ergodic schemes. Furthermore, it is proved that SELA is absolutely expedient in every stationary S-model random environment.

机译：引入了一类新的学习自动机。新的自动机使用随机估计器，并且能够在非平稳环境中以高精度和高自适应率运行。根据随机估计器方案，将随机计算操作的平均回报估计值。因此，它们并不严格依赖于环境响应。当确定性估计量过时并且可能无效时，随机估计量和确定性估计量的内容之间的依赖性会更加宽松。以此方式，最近未被选择的动作有机会被估计为“最佳”，以增加其选择概率，并因此被选择。因此，估计器总是最近更新，因此能够适应环境变化。随机估计器学习自动机（SELA）的性能优于以前众所周知的S模型遍历方案。此外，事实证明，SELA在每个平稳的S模型随机环境中都是绝对有利的。

著录项

来源
《IEEE Transactions on Knowledge and Data Engineering》 |1994年第4期|P.649-654|共6页
作者
Papadimitriou G.I.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. A new approach to the design of reinforcement schemes for learning automata [J] . Thathachar M. A. L., Sastry P. S. Systems, Man and Cybernetics, IEEE Transactions on . 1985,第1期

机译：设计用于学习自动机的强化方案的新方法
2. Sampling algorithms for stochastic graphs: A learning automata approach [J] . Rezvanian Alireza, Meybodi Mohammad Reza Knowledge-Based Systems . 2017,第Jula1期

机译：随机图的采样算法：一种学习自动机方法
3. Opposition-based discrete action reinforcement learning automata algorithm case study: optimal design of a PID controller [J] . FATEMEH MOHSENI POUR, ALI AKBAR GHARAVEISI Turkish Journal of Electrical Engineering and Computer Sciences . 2013,第6期

机译：基于对立的离散动作强化学习自动机算法案例研究：PID控制器的优化设计
4. A new approach to the design of reinforcement schemes for learning automata: stochastic estimator learning algorithms [C] . Vasilakos, A.V., Papadimitriou, . 1991

机译：设计用于学习自动机的增强方案的新方法：随机估计器学习算法
5. On the convergence of model -free policy iteration algorithms for reinforcement learning: Stochastic approximation under discontinuous mean dynamics. [D] . Williams, John Kevin. 2000

机译：关于用于增强学习的无模型策略迭代算法的收敛：不连续平均动力学下的随机逼近。
6. A novel microaneurysms detection approach based on convolutional neural networks with reinforcement sample learning algorithm [O] . Umit Budak, Abdulkadir Şengür, Yanhui Guo, 2017

机译：基于卷积神经网络的增强样本学习算法的微动脉瘤检测新方法
7. Response threshold models, stochastic learning automata and ant colony optimization-based decentralized self-coordination algorithms for heterogeneous multi-tasks distribution in multi-robot systems [O] . Quiñonez Carrillo Alma Yadira 2012

机译：基于响应阈值模型，随机学习自动机和基于蚁群优化的分散式自协调算法，用于多机器人系统中的异构多任务分配
8. Quantitative Analysis of the Effect of Market Design and Policy Uncertainty on Investment in Electricity Generation: A Reinforcement Learning Approach [R] . Grobman, J. H. 2000

机译：发电投资的不确定性：强化学习方法

A new approach to the design of reinforcement schemes for learning automata: stochastic estimator learning algorithms

摘要

著录项

相似文献

相关主题

期刊订阅