Looking Back and Ahead: Adaptation and Planning by Gradient Descent

机译：回顾和前进：通过梯度下降调整和规划

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Adaptation and planning are crucial for both biological and artificial agents. In this study, we treat these as an inference problem that we solve using a gradient-based optimization approach. We propose adaptation and planning by gradient descent (APGraDe), a gradient-based computational framework with a hierarchical recurrent neural network (RNN) for adaptation and planning. This framework computes (counterfactual) prediction errors by looking back on past situations based on actual observations and by looking ahead to future situations based on preferred observations (or goal). The internal state of the higher level of the RNN is optimized in the direction of minimizing these errors. The errors for the past contribute to the adaptation while errors for the future contribute to the planning. The proposed APGraDe framework is implemented in a humanoid robot and the robot performs a ball manipulation task with a human experimenter. Experimental results show that given a particular preference, the robot can adapt to unexpected situations while pursuing its own preference through the planning of future actions.

机译：适应和规划对于生物和人工剂至关重要。在这项研究中，我们将这些视为使用基于梯度的优化方法解决的推理问题。我们通过梯度下降（APGRADE），基于梯度的计算框架提出适应和规划，具有用于适应和规划的分层经常性神经网络（RNN）。该框架通过基于实际观察和基于首选观察（或目标）来向未来的情况展望未来的情况，通过回顾过去的情况来计算（反事实）预测错误。在最小化这些误差的方向上优化RNN的更高级别的内部状态。过去的错误有助于适应，而未来的错误有助于规划。所提出的APGRADE框架是在人形机器人中实现的，并且机器人通过人类实验者执行球操纵任务。实验结果表明，考虑到特定的偏好，机器人可以通过规划未来的行动来追求自己的偏好，适应意外情况。

著录项

来源
《International Conference on Development and Learning and Epigenetic Robotics》|2019年|340p|共6页
会议地点
作者
Shingo Murata; Hiroki Sawa; Shigeki Sugano; Tetsuya Ogata;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类机器人技术;
关键词
Planning; Robots; Optimization; Microsoft Windows; Task analysis; Erbium; Recurrent neural networks;

机译：规划;机器人;优化;微软Windows;任务分析;erbium;经常性神经网络;

相似文献

外文文献
中文文献
专利

1. Application of a Gradient Descent Continuous Actor-Critic Algorithm for Double-Side Day-Ahead Electricity Market Modeling [J] . Huiru Zhao, Yuwei Wang, Sen Guo, Energies . 2016,第9期

机译：梯度下降连续主演算法在双面日前电力市场建模中的应用
2. Gradient descent optimization for visual tracking with geometrics transformation adaptation [J] . Younes Dhassi, Samah Elkah, Abdellah Aarab Procedia Computer Science . 2019,第22期

机译：梯度下降优化，用于几何跟踪自适应的视觉跟踪
3. Gradient descent optimization for visual tracking with geometrics transformation adaptation [J] . Younes Dhassi, Samah Elkah, Abdellah Aarab Procedia Computer Science . 2019,第11期

机译：梯度下降优化，用于几何跟踪自适应的视觉跟踪
4. Looking Back and Ahead: Adaptation and Planning by Gradient Descent [C] . Shingo Murata, Hiroki Sawa, Shigeki Sugano, International Conference on Development and Learning and Epigenetic Robotics . 2019

机译：回顾与展望：梯度下降的适应与规划
5. Physiologically-based vision modeling applications and gradient descent-based parameter adaptation of pulse coupled neural networks. [D] . Broussard, Randy Paul. 1997

机译：基于生理的视觉建模应用和基于梯度下降的脉冲耦合神经网络参数自适应。
6. Some behavioral aspects of energy descent: how a biophysical psychology might help people transition through the lean times ahead [O] . Raymond De Young 2014

机译：能量下降的一些行为方面：生物物理心理学如何帮助人们度过美好时光
7. Application of a Gradient Descent Continuous Actor-Critic Algorithm for Double-Side Day-Ahead Electricity Market Modeling [O] . Huiru Zhao, Yuwei Wang, Sen Guo, 2016

机译：梯度下降连续actor-Critic算法在双面日前电力市场建模中的应用
8. Physiologically-Based Vision Modeling Applications and Gradient Descent-BasedParameter Adaptation of Pulse Coupled Neural Networks [R] . Broussard, R. P. 1997

机译：基于生理学的视觉建模应用和基于梯度下降的脉冲耦合神经网络参数自适应

Looking Back and Ahead: Adaptation and Planning by Gradient Descent

摘要

著录项

相似文献

相关主题

期刊订阅