Deep Feedback Learning

机译：深度反馈学习

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

An agent acting in an environment aims to minimise uncertainties so that being attacked can be predicted, and rewards are not only found by chance. These events define an error signal which can be used to improve performance. In this paper we present a new algorithm where an error signal from a reflex trains a novel deep network: the error is propagated forwards through the network from its input to its output, in order to generate pro-active actions. We demonstrate the algorithm in two scenarios: a lst-person shooter game and a driving car scenario, where in both cases the network develops strategies to become pro-active.

机译：在环境中行动的特工旨在最大程度地减少不确定性，以便可以预测被攻击的程度，并且不仅会偶然发现回报。这些事件定义了可用于提高性能的错误信号。在本文中，我们提出了一种新算法，其中来自反射的错误信号会训练一个新型的深层网络：错误会通过网络从其输入传播到其输出，从而生成主动行为。我们在两种场景中演示该算法：第一人称射击游戏和驾驶汽车场景，在这两种情况下，网络都制定了主动策略。

著录项

来源
《International conference on simulation of adaptive behavior》|2018年|189-200|共12页
会议地点
作者
Bernd Porr; Paul Miller;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Feedback2Code: A Deep Learning Approach to Identifying User-Feedback-Related Source Code Files [J] . Shuhan Yan, Tianjiao Du, Beijun Shen, International journal of software engineering and knowledge engineering . 2020,第1期

机译：Feedback2Code：一种深度学习方法，用于识别与用户反馈相关的源代码文件
2. Intelligent System of Somatosensory Music Therapy Information Feedback in Deep Learning Environment [J] . Nan Zhao Complexity . 2021,第a期

机译：深度学习环境中的躯体感觉音乐治疗信息的智能系统
3. Adaptive generalized ZEM-ZEV feedback guidance for planetary landing via a deep reinforcement learning approach [J] . Acta astronautica . 2020,第Juna期

机译：通过深度强化学习方法对行星着陆的自适应广义ZEM-ZEV反馈指导
4. Student’s Feedback by emotion and speech recognition through Deep Learning [C] . Ati Jain, Hare Ram Sah International Conference on Computing, Communication and Intelligent Systems . 2021

机译：学生通过深入学习的情感和语音识别的反馈
5. Dynamics and Control of a 3-DOF Manipulator with Deep Learning Feedback [D] . Valdez, Brian O. 2020

机译：具有深度学习反馈的三自由度机械手的动态与控制
6. Learning Without Feedback: Fixed Random Learning Signals Allow for Feedforward Training of Deep Neural Networks [O] . Charlotte Frenkel, Martin Lefebvre, David Bol 2021

机译：没有反馈的学习：固定随机学习信号允许深神经网络的前馈训练
7. Deep Learning Control for Digital Feedback Systems: Improved Performance with Robustness against Parameter Change [O] . Nuha A. S. Alwan, Zahir M. Hussain 2021

机译：数字反馈系统的深度学习控制：具有鲁棒性的改进性能，反对参数变化

Deep Feedback Learning

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅