首页> 外文期刊>Neural computing & applications >A novel approach to accelerate the convergence speed of a stochastic multi-agent system using recurrent neural nets
【24h】

A novel approach to accelerate the convergence speed of a stochastic multi-agent system using recurrent neural nets

机译:利用递归神经网络加快随机多智能体系统收敛速度的新方法

获取原文
获取原文并翻译 | 示例
           

摘要

One problem in the design of multi-agent systems is the difficulty of predicting the occurrences that one agent might face, also to recognize and to predict their optimum behavior in these situations. Therefore, one of the most important characteristic of the agent is their ability during adoption, to learn, and correct their behavior. With consideration of the continuously changing environment, the back and forth learning of the agents, the inability to see the agent's action first hand, and their chosen strategies, learning in a multi-agent environment can be very complex. On the one hand, with recognition to the current learning models that are used in deterministic environment that behaves linearly, which contain weaknesses; therefore, the current learning models are unproductive in complex environments that the actions of agents are stochastic. Therefore, it is necessary for the creation of learning models that are effective in stochastic environments. Purpose of this research is the creation of such a learning model. For this reason, the Hopfield and Boltzmann learning algorithms are used. In order to demonstrate the performance of their algorithms, first, an unlearned multi-agent model is created. During the interactions of the agents, they try to increase their knowledge to reach a specific value. The predicated index is the number of changed states needed to reach the convergence. Then, the learned multi-agent model is created with the Hopfield learning algorithm, and in the end, the learned multi-agent model is created with the Boltzmann learning algorithm. After analyzing the obtained figures, a conclusion can be made that when learning impose to multi-agent environment the average number of changed states needed to reach the convergence decreased and the use of Boltzmann learning algorithm decreased the average number of changed states even further in comparison with Hopfield learning algorithm due to the increase in the number of choices in each situation. Therefore, it is possible to say that the multi-agent systems behave stochastically, the more closer they behave to their true character, the speed of reaching the global solution increases.
机译:多智能体系统设计中的一个问题是难以预测一个智能体可能面对的事件,也难以识别和预测这些情况下的最佳行为。因此,代理的最重要特征之一就是他们在收养过程中学习,纠正其行为的能力。考虑到不断变化的环境,代理的来回学习,无法直接看到代理的行为及其选择的策略,在多代理环境中进行学习可能会非常复杂。一方面,认识到当前在确定性环境中使用的线性学习行为中存在的学习模型;因此,当前的学习模型在代理的行为是随机的复杂环境中是无用的。因此,有必要创建在随机环境中有效的学习模型。这项研究的目的是创建这种学习模型。因此,使用了Hopfield和Boltzmann学习算法。为了演示其算法的性能,首先,创建了一个未经学习的多主体模型。在代理的交互过程中,他们尝试增加其知识以达到特定的价值。谓词索引是达到收敛所需的已更改状态的数量。然后,使用Hopfield学习算法创建学习的多主体模型,最后使用玻尔兹曼学习算法创建学习的多主体模型。在对获得的数据进行分析之后,可以得出结论,当在多智能体环境中进行学习时,达到收敛所需的平均已改变状态数会减少,并且与之相比,使用Boltzmann学习算法还会进一步减少平均已改变状态数使用Hopfield学习算法的原因是每种情况下选择的数量都会增加。因此,可以说多智能体系统的行为是随机的,它们越接近其真实特征,则达到全局解的速度就会增加。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号