...
首页> 外文期刊>IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics >Generalized pursuit learning schemes: new families of continuous and discretized learning automata
【24h】

Generalized pursuit learning schemes: new families of continuous and discretized learning automata

机译:广义追求学习计划:连续和离散学习自动机的新家族

获取原文
获取原文并翻译 | 示例
           

摘要

The fastest learning automata (LA) algorithms currently available fall in the family of estimator algorithms introduced by Thathachar and Sastry (1986). The pioneering work of these authors was the pursuit algorithm, which pursues only the current estimated optimal action. If this action is not the one with the minimum penalty probability, this algorithm pursues a wrong action. In this paper, we argue that a pursuit scheme that generalizes the traditional pursuit algorithm by pursuing all the actions with higher reward estimates than the chosen action, minimizes the probability of pursuing a wrong action, and is a faster converging scheme. To attest this, we present two new generalized pursuit algorithms (GPAs) and also present a quantitative comparison of their performance against the existing pursuit algorithms. Empirically, the algorithms proposed here are among the fastest reported LA to date.
机译:当前可用的最快的学习自动机(LA)算法属于Thathachar和Sastry(1986)引入的估计器算法系列。这些作者的开创性工作是追踪算法,该算法仅追踪当前估计的最佳动作。如果该动作不是惩罚概率最小的动作,则该算法执行错误的动作。在本文中,我们认为一种跟踪方案,通过以比所选择的动作更高的奖励估计来追求所有动作来推广传统的跟踪算法,将追求错误动作的可能性降到最低,并且是一种更快的收敛方案。为了证明这一点,我们提出了两种新的广义追踪算法(GPA),并提出了它们与现有追踪算法的性能的定量比较。根据经验,此处提出的算法是迄今为止报道最快的LA之一。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号