Studies on the less-used actions exploration problem of a rationing algorithm based on reinforcement learning

机译：基于强化学习的配给算法较少使用动作的研究

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Programming by demonstration is an interesting subject in the field of robotics and it is developing more and more in the direction of robots for services and humanoid robots. Programming by demonstration is much less researched when it comes to industrial robots. One of the reasons is that an industrial robot has to act in a precise and certain manner. However, extending research regarding programming by demonstration to the field of industrial robots could lead to the creation of intelligent systems where the industrial robot could be programmed in an easier way. The goal of our research is to develop an intelligent system useful for industrial robot programming by demonstration. The reasoning algorithms are the mechanisms which offer flexibility to the proposed system. We have focused our research on the creation of a reasoning algorithm based on artificial neural networks [1, 2]. Because the results of this algorithm were not satisfying we have switched our focus to the development of a reasoning algorithm based on reinforcement learning [3]. The algorithm is based on the idea that marks can be assigned to each possible action whenever the robot is in an unknown state. The exploration of less-used actions plays also an important role in the case the robot must to take a decision. Based on the marks and on the exploration feature of the algorithm the robot updates its behaviour. This paper presents a description and some studies on less-used actions exploration problem of the algorithm. Some chapters of the paper will deal with the problems implementing the algorithm, the conducted experiments in terms of exploration feature of the algorithm and the results obtained. The analysis of the results and the characteristics of the algorithm in terms of less-used actions exploration are also discussed in this paper.

机译：通过演示编程是机器人领域的一个有趣的主题，它在机器人的方向上发展了服务和人形机器人的方向。通过演示编程在工业机器人方面的研发得多。其中一个原因是工业机器人必须以准确和某种方式行事。然而，通过对工业机器人领域的示范进行编程的扩展研究可能导致创建工业机器人可以以更轻松的方式编程的智能系统。我们的研究的目标是开发一个可用于通过示范的工业机器人编程的智能系统。推理算法是为所提出的系统提供灵活性的机制。我们专注于基于人工神经网络的推理算法的研究重点[1,2]。由于该算法的结果不满足，我们已经将我们的重点转换为基于加强学习的推理算法的开发[3]。该算法基于当机器人处于未知状态时，可以将标记分配给每个可能的动作的想法。较少使用的行动的探索在机器人必须做出决定的情况下也发挥着重要作用。基于标记和算法的探索特征，机器人更新其行为。本文介绍了对算法较少使用的探索问题的描述和一些研究。本文的一些章节将处理实施算法的问题，在算法的勘探特征方面进行了对实验和获得的结果。本文还讨论了对结果的分析和算法的特征，并在本文中讨论了较少使用的动作探索。

著录项

来源
《IEEE International Conference on Intelligence Engineering Systems》|2011年||共6页
会议地点
作者
Stoica M.; Calangiu G. A.; Sisak F.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词
入库时间 2022-08-21 10:20:26

相似文献

外文文献
中文文献
专利

1. Opposition-based discrete action reinforcement learning automata algorithm case study: optimal design of a PID controller [J] . FATEMEH MOHSENI POUR, ALI AKBAR GHARAVEISI Turkish Journal of Electrical Engineering and Computer Sciences . 2013,第6期

机译：基于对立的离散动作强化学习自动机算法案例研究：PID控制器的优化设计
2. Memory-based reinforcement learning algorithm for autonomous exploration in unknown environment [J] . Amir Ramezani Dooraki, Deok Jin Lee International Journal of Advanced Robotic Systems . 2018,第3期

机译：未知环境下自主探索的基于记忆的强化学习算法
3. Corticostriatal circuit mechanisms of value-based action selection: Implementation of reinforcement learning algorithms and beyond [J] . Morita Kenji, Jitsev Jenia, Morrison Abigail Behavioural Brain Research: An International Journal . 2016,第Null期

机译：基于价值的动作选择的皮质口电路机制：强化学习算法的实现及其他
4. Studies on the less-used actions exploration problem of a rationing algorithm based on reinforcement learning [C] . Stoica M., Calangiu G. A., Sisak F. 15th International conference on intelligent engineering systems . 2011

机译：基于强化学习的配给算法的少用动作探索问题研究
5. A New Reinforcement Learning Algorithm with Fixed Exploration for Semi-Markov Decision Processes [D] . Encapera, Angelo Michael. 2017

机译：半马尔可夫决策过程的固定探索新强化学习算法
6. Mobile robots exploration through cnn-based reinforcement learning [O] . Lei Tai, Ming Liu -1

机译：通过基于CNN的强化学习探索移动机器人
7. The Effect of Representation and Knowledge on Goal-Directed Exploration with Reinforcement-Learning Algorithms [O] . Sven Koenig, Reid G. Simmons, P. Kaelbling 1996

机译：强化学习算法在表征和知识对目标导向探索中的作用

Studies on the less-used actions exploration problem of a rationing algorithm based on reinforcement learning

摘要

著录项

相似文献

相关主题

期刊订阅