首页> 外文期刊>Knowledge-Based Systems >Reinforcement learning in learning automata and cellular learning automata via multiple reinforcement signals
【24h】

Reinforcement learning in learning automata and cellular learning automata via multiple reinforcement signals

机译:通过多个增强信号学习自动机和细胞学习自动机中的增强学习

获取原文
获取原文并翻译 | 示例
       

摘要

Many scientific and engineering problems are decentralized in nature. Various distributed approaches have been developed for solving these problems, and among them, cellular learning automaton has demonstrated to be an effective model for systems consisting of a large number of interacting components. In the cellular learning automata approach, each such component is modeled by a learning automaton. The learning automaton associated with a component aims to learn the action which best suites with its neighboring components. This objective becomes more challenging when the automaton is required to find the optimal subset of its available actions. The common learning automata algorithms can deal with this problem by considering all combinations of their allowable actions as new action sets. However, this approach is only applicable for small action spaces. The current work extends some common learning automata algorithms so that they can efficiently learn the optimal subset of their actions through parallel reinforcements. These parallel reinforcements represent the favorability of each action in the performed subset of actions; consequently, the learning automaton would be able to learn the effectiveness of each action individually. By integrating the new LA models in a cellular learning automaton, each component of the system is able to interact with its neighbors simultaneously via multiple actions. In order to investigate the effectiveness of the proposed models, their applicability on a channel assignment problem is investigated experimentally. The achieved results demonstrate the efficiency of the proposed multi-reinforcement learning schemes. (C) 2019 Elsevier B.V. All rights reserved.
机译:本质上,许多科学和工程问题都是分散的。已经开发出各种分布式方法来解决这些问题,并且其中,蜂窝学习自动机已被证明是由大量交互组件组成的系统的有效模型。在细胞学习自动机方法中,每个这样的组件都由学习自动机建模。与组件关联的学习自动机旨在学习最适合其相邻组件的动作。当需要自动机查找其可用操作的最佳子集时,此目标变得更具挑战性。通用学习自动机算法可以通过将其允许动作的所有组合视为新动作集来解决此问题。但是,此方法仅适用于小型活动空间。当前的工作扩展了一些常见的学习自动机算法,以便他们可以通过并行增强有效地学习其动作的最佳子集。这些平行的增强表示每个动作在执行的动作子集中的可取性。因此,学习自动机将能够单独学习每个动作的有效性。通过将新的LA模型集成到细胞学习自动机中,系统的每个组件都可以通过多个动作同时与其邻居进行交互。为了研究所提出模型的有效性,通过实验研究了它们在信道分配问题上的适用性。所获得的结果证明了所提出的多强化学习方案的效率。 (C)2019 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号