...
首页> 外文期刊>IEEE Transactions on Knowledge and Data Engineering >A new approach to the design of reinforcement schemes for learning automata: stochastic estimator learning algorithms
【24h】

A new approach to the design of reinforcement schemes for learning automata: stochastic estimator learning algorithms

机译:设计用于学习自动机的增强方案的新方法:随机估计器学习算法

获取原文
获取原文并翻译 | 示例
           

摘要

A new class of learning automata is introduced. The new automata use a stochastic estimator and are able to operate in nonstationary environments with high accuracy and a high adaptation rate. According to the stochastic estimator scheme, the estimates of the mean rewards of actions are computed stochastically. So, they are not strictly dependent on the environmental responses. The dependence between the stochastic estimates and the deterministic estimator's contents is more relaxed when the latter are old and probably invalid. In this way, actions that have not been selected recently have the opportunity to be estimated as "optimal", to increase their choice probability, and, consequently, to be selected. Thus, the estimator is always recently updated and consequently is able to be adapted to environmental changes. The performance of the Stochastic Estimator Learning Automaton (SELA) is superior to the previous well-known S-model ergodic schemes. Furthermore, it is proved that SELA is absolutely expedient in every stationary S-model random environment.
机译:引入了一类新的学习自动机。新的自动机使用随机估计器,并且能够在非平稳环境中以高精度和高自适应率运行。根据随机估计器方案,将随机计算操作的平均回报估计值。因此,它们并不严格依赖于环境响应。当确定性估计量过时并且可能无效时,随机估计量和确定性估计量的内容之间的依赖性会更加宽松。以此方式,最近未被选择的动作有机会被估计为“最佳”,以增加其选择概率,并因此被选择。因此,估计器总是最近更新,因此能够适应环境变化。随机估计器学习自动机(SELA)的性能优于以前众所周知的S模型遍历方案。此外,事实证明,SELA在每个平稳的S模型随机环境中都是绝对有利的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号