首页> 外文会议>International Conference on Development and Learning and Epigenetic Robotics >Looking Back and Ahead: Adaptation and Planning by Gradient Descent
【24h】

Looking Back and Ahead: Adaptation and Planning by Gradient Descent

机译:回顾和前进:通过梯度下降调整和规划

获取原文

摘要

Adaptation and planning are crucial for both biological and artificial agents. In this study, we treat these as an inference problem that we solve using a gradient-based optimization approach. We propose adaptation and planning by gradient descent (APGraDe), a gradient-based computational framework with a hierarchical recurrent neural network (RNN) for adaptation and planning. This framework computes (counterfactual) prediction errors by looking back on past situations based on actual observations and by looking ahead to future situations based on preferred observations (or goal). The internal state of the higher level of the RNN is optimized in the direction of minimizing these errors. The errors for the past contribute to the adaptation while errors for the future contribute to the planning. The proposed APGraDe framework is implemented in a humanoid robot and the robot performs a ball manipulation task with a human experimenter. Experimental results show that given a particular preference, the robot can adapt to unexpected situations while pursuing its own preference through the planning of future actions.
机译:适应和规划对于生物和人工剂至关重要。在这项研究中,我们将这些视为使用基于梯度的优化方法解决的推理问题。我们通过梯度下降(APGRADE),基于梯度的计算框架提出适应和规划,具有用于适应和规划的分层经常性神经网络(RNN)。该框架通过基于实际观察和基于首选观察(或目标)来向未来的情况展望未来的情况,通过回顾过去的情况来计算(反事实)预测错误。在最小化这些误差的方向上优化RNN的更高级别的内部状态。过去的错误有助于适应,而未来的错误有助于规划。所提出的APGRADE框架是在人形机器人中实现的,并且机器人通过人类实验者执行球操纵任务。实验结果表明,考虑到特定的偏好,机器人可以通过规划未来的行动来追求自己的偏好,适应意外情况。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号