Graduate School of Science and Engineering, Yamaguchi University, Ube, Yamaguchi, Japan;
School of Information Science and Technology, Aichi Prefectural University, Nagakute, Aichi, Japan;
RNNPB; Humanoid; robot; BPTT; reinforcement; learning; multiple; action; sequences;