Multiple Action Sequence Learning and Automatic Generation for a Humanoid Robot Using RNNPB and Reinforcement Learning

Takashi Kuremoto; Koichi Hashiguchi; Keita Morisaki; Shun Watanabe; Kunikazu Kobayashi; Shingo Mabu; Masanao Obayashi

首页> 中文期刊>软件工程与应用（英文） >Multiple Action Sequence Learning and Automatic Generation for a Humanoid Robot Using RNNPB and Reinforcement Learning

Multiple Action Sequence Learning and Automatic Generation for a Humanoid Robot Using RNNPB and Reinforcement Learning

开具论文收录证明 >>

期刊封面封底目录下载 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper proposes how to learn and generate multiple action sequences of a humanoid robot. At first, all the basic action sequences, also called primitive behaviors, are learned by a recurrent neural network with parametric bias (RNNPB) and the value of the internal nodes which are parametric bias (PB) determining the output with different primitive behaviors are obtained. The training of the RNN uses back propagation through time (BPTT) method. After that, to generate the learned behaviors, or a more complex behavior which is the combination of the primitive behaviors, a reinforcement learning algorithm: Q-learning (QL) is adopt to determine which PB value is adaptive for the generation. Finally, using a real humanoid robot, the proposed method was confirmed its effectiveness by the results of experiment.

著录项

来源
《软件工程与应用（英文）》|2012年第12期|128-133|共6页
作者
Takashi Kuremoto; Koichi Hashiguchi; Keita Morisaki; Shun Watanabe; Kunikazu Kobayashi; Shingo Mabu; Masanao Obayashi;
展开▼
作者单位

Graduate School of Science and Engineering, Yamaguchi University, Ube, Yamaguchi, Japan;

School of Information Science and Technology, Aichi Prefectural University, Nagakute, Aichi, Japan;

展开▼
原文格式 PDF
正文语种 chi
中图分类肿瘤学;
关键词
RNNPB; Humanoid; robot; BPTT; reinforcement; learning; multiple; action; sequences;

相似文献

中文文献
外文文献

Multiple Action Sequence Learning and Automatic Generation for a Humanoid Robot Using RNNPB and Reinforcement Learning

摘要

著录项

相似文献

相关主题

期刊订阅