首页> 外文会议>IEEE International Conference on Intelligence Engineering Systems >Studies on the less-used actions exploration problem of a rationing algorithm based on reinforcement learning
【24h】

Studies on the less-used actions exploration problem of a rationing algorithm based on reinforcement learning

机译:基于强化学习的配给算法较少使用动作的研究

获取原文

摘要

Programming by demonstration is an interesting subject in the field of robotics and it is developing more and more in the direction of robots for services and humanoid robots. Programming by demonstration is much less researched when it comes to industrial robots. One of the reasons is that an industrial robot has to act in a precise and certain manner. However, extending research regarding programming by demonstration to the field of industrial robots could lead to the creation of intelligent systems where the industrial robot could be programmed in an easier way. The goal of our research is to develop an intelligent system useful for industrial robot programming by demonstration. The reasoning algorithms are the mechanisms which offer flexibility to the proposed system. We have focused our research on the creation of a reasoning algorithm based on artificial neural networks [1, 2]. Because the results of this algorithm were not satisfying we have switched our focus to the development of a reasoning algorithm based on reinforcement learning [3]. The algorithm is based on the idea that marks can be assigned to each possible action whenever the robot is in an unknown state. The exploration of less-used actions plays also an important role in the case the robot must to take a decision. Based on the marks and on the exploration feature of the algorithm the robot updates its behaviour. This paper presents a description and some studies on less-used actions exploration problem of the algorithm. Some chapters of the paper will deal with the problems implementing the algorithm, the conducted experiments in terms of exploration feature of the algorithm and the results obtained. The analysis of the results and the characteristics of the algorithm in terms of less-used actions exploration are also discussed in this paper.
机译:通过演示编程是机器人领域的一个有趣的主题,它在机器人的方向上发展了服务和人形机器人的方向。通过演示编程在工业机器人方面的研发得多。其中一个原因是工业机器人必须以准确和某种方式行事。然而,通过对工业机器人领域的示范进行编程的扩展研究可能导致创建工业机器人可以以更轻松的方式编程的智能系统。我们的研究的目标是开发一个可用于通过示范的工业机器人编程的智能系统。推理算法是为所提出的系统提供灵活性的机制。我们专注于基于人工神经网络的推理算法的研究重点[1,2]。由于该算法的结果不满足,我们已经将我们的重点转换为基于加强学习的推理算法的开发[3]。该算法基于当机器人处于未知状态时,可以将标记分配给每个可能的动作的想法。较少使用的行动的探索在机器人必须做出决定的情况下也发挥着重要作用。基于标记和算法的探索特征,机器人更新其行为。本文介绍了对算法较少使用的探索问题的描述和一些研究。本文的一些章节将处理实施算法的问题,在算法的勘探特征方面进行了对实验和获得的结果。本文还讨论了对结果的分析和算法的特征,并在本文中讨论了较少使用的动作探索。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号