An Adaptive Actor-critic Algorithm with Multi-step Simulated Experiences for Controlling Nonholonomic Mobile Robots

Rafiuddin Syam; Keigo Watanabe; Kiyotaka Izumi

首页> 外文期刊>Soft Computing >An Adaptive Actor-critic Algorithm with Multi-step Simulated Experiences for Controlling Nonholonomic Mobile Robots

【24h】

An Adaptive Actor-critic Algorithm with Multi-step Simulated Experiences for Controlling Nonholonomic Mobile Robots

机译：具有多步模拟经验的自适应Actor-Crit算法，用于控制非完整移动机器人

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In this paper, we propose a new algorithm of an adaptive actor-critic method with multi-step simulated experiences, as a kind of temporal difference (TD) method. In our approach, the TD-error is composed of two value- functions and m utility functions, where m denotes the number of multi-steps in which the experience should be simulated. The value-function is constructed from the critic formulated by a radial basis function neural network (RBFNN), which has a simulated experience as an input, generated from a predictive model based on a kinematic model. Thus, since our approach assumes that the model is available to simulate the m-step experiences and to design a controller, such a kinematic model is also applied to construct the actor and the resultant model based actor (MBA) is also regarded as a network, i.e., it is just viewed as a resolved velocity control network. We implement this approach to control nonholonomic mobile robot, especially in a trajectory tracking control problem for the position coordinates and azimuth. Some simulations show the effectiveness of the proposed method for controlling a mobile robot with two-independent driving wheels.

机译：在本文中，我们提出了一种具有多步仿真经验的自适应演员批判方法的新算法，作为一种时差（TD）方法。在我们的方法中，TD误差由两个值函数和m个效用函数组成，其中m表示应模拟经验的多步数。价值函数由径向基函数神经网络（RBFNN）制定的注释器构造而成，该函数具有模拟输入经验，是基于运动学模型的预测模型生成的。因此，由于我们的方法假设该模型可用于模拟m步经验并设计控制器，因此这种运动学模型也可用于构造参与者，并且基于结果的参与者（MBA）模型也被视为网络，即，它只是被视为分解的速度控制网络。我们实施这种方法来控制非完整移动机器人，尤其是在位置坐标和方位角的轨迹跟踪控制问题中。一些仿真显示了所提出的方法用于控制带有两个独立驱动轮的移动机器人的有效性。

著录项

来源
《Soft Computing》 |2007年第1期|81-89|共9页
作者
Rafiuddin Syam; Keigo Watanabe; Kiyotaka Izumi;
展开▼
作者单位

Department of Advanced Systems Control Engineering Graduate School of Science and Engineering Saga University 1 Honjomachi Saga 840-8502 Japan;

Department of Advanced Systems Control Engineering Graduate School of Science and Engineering Saga University 1 Honjomachi Saga 840-8502 Japan;

Department of Advanced Systems Control Engineering Graduate School of Science and Engineering Saga University 1 Honjomachi Saga 840-8502 Japan;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Actor-critic algorithms; Kinematic model; Multi-step prediction; Nonholonomic mobile robot; Nonlinear predictive model; Simulated experience;

机译：行为准则算法;运动学模型;多步预测;非完整移动机器人;非线性预测模型;模拟经验;

相似文献

外文文献
中文文献
专利

1. An adaptive actor-critic algorithm with multi-step simulated experiences for controlling nonholonomic mobile robots [J] . Syam R, Watanabe K, Izumi K Soft computing: A fusion of foundations, methodologies and applications . 2007,第1期

机译：具有多步模拟经验的自适应角色批评算法，用于控制非完整移动机器人
2. STABILIZING CONTROL ALGORITHM FOR NONHOLONOMIC WHEELED MOBILE ROBOTS USING ADAPTIVE INTEGRAL SLIDING MODE [J] . Abbasi Waseem, Rehman Fazal Ur, Shah Ibrahim, International Journal of Robotics & Automation . 2019,第2期

机译：使用自适应整体滑动模式稳定非全面轮式移动机器人控制算法
3. Adaptive actor-critic learning for the control of mobile robots by applying predictive models [J] . Syam R, Watanabe K, Izumi K Soft computing: A fusion of foundations, methodologies and applications . 2005,第11期

机译：通过应用预测模型对移动机器人进行控制的自适应行为者批判学习
4. Control of nonholonomic mobile robot by an adaptive actor-critic method with simulated experience based value-punctions [C] . Syam, R., Watanabe, Robotics and Automation, 2002. Proceedings. ICRA '02. IEEE International Conference on . 2002

机译：通过基于行为模拟的经验参与者自适应方法对非完整移动机器人进行控制
5. Geometric methods for control of nonholonomic mechanical systems with applications to the control moment gyroscope and wheeled mobile robots. [D] . Amengonu, Yawo H. 2015

机译：用于控制非完整机械系统的几何方法及其在控制力矩陀螺仪和轮式移动机器人中的应用。
6. A Sensor Fusion Based Nonholonomic Wheeled Mobile Robot for Tracking Control [O] . Shun-Hung Tsai, Li-Hsiang Kao, Hung-Yi Lin, 2020

机译：基于传感器融合的非全本轮式移动机器人用于跟踪控制
7. An adaptive actor-critic algorithm with multi-step simulated experiences for controlling nonholonomic mobile robots [O] . Syam Rafiuddin 2007

机译：具有多步模拟经验的自适应角色批评算法，用于控制非完整移动机器人

An Adaptive Actor-critic Algorithm with Multi-step Simulated Experiences for Controlling Nonholonomic Mobile Robots

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅