Restraining Bolts for Reinforcement Learning Agents

机译：用于加固学习代理的限制螺栓

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this work we have investigated the concept of "restraining bolt", inspired by Science Fiction. We have two distinct sets of features extracted from the world, one by the agent and one by the authority imposing some restraining specifications on the behaviour of the agent (the "restraining bolt"). The two sets of features and, hence the model of the world attainable from them, are apparently unrelated since of interest to independent parties. However they both account for (aspects of) the same world. We have considered the case in which the agent is a reinforcement learning agent on a set of low-level (subsymbolic) features, while the restraining bolt is specified logically using linear time logic on finite traces ltl_f/LDL_f over a set of high-level symbolic features. We show formally, and illustrate with examples, that, under general circumstances, the agent can learn while shaping its goals to suitably conform (as much as possible) to the restraining bolt specifications.

机译：在这项工作中，我们研究了“抑制螺栓”的概念，灵感来自科幻小说。我们有两个从世界提取的两种不同的特征，一个由代理商一个由代理商和权威机构施加了一些关于代理的行为的抑制规范（“抑制螺栓”）。这两套特征，因此，自从独立方感兴趣以来，他们所能实现的世界的模型显然无关。但是，他们都占（方面）同一个世界。我们已经考虑了代理是在一组低级（亚jbolic）特征上的加强学习代理的情况，而约束螺栓在一组高电平上使用有限迹线LTL_F / LDL_F上的线性时间逻辑逻辑地指定象征性功能。我们正式地展示，并用示例说明，即在一般情况下，代理可以在塑造其目标以适当地符合（尽可能多地）到约束螺栓规格。

著录项

来源
《AAAI Conference on Artificial Intelligence;AAAI Symposium on Educational Advances in Artificial Intelligence》|2020年|13350-13707p|共4页
会议地点
作者
Giuseppe De Giacomo; Luca Iocchi; Marco Favorito; Fabio Patrizi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词

相似文献

外文文献
中文文献
专利

1. Reinforcement learning technique using agent state occurrence frequency with analysis of knowledge sharing on the agent’s learning process in multiagent environments [J] . H. S. Al-Dayaa, D. B. Megherbi The Journal of Supercomputing . 2012,第1期

机译：使用座席状态发生频率并分析多座席环境中座席学习过程中的知识共享的强化学习技术
2. Reinforcement learning technique using agent state occurrence frequency with analysis of knowledge sharing on the agent's learning process in multiagent environments [J] . H.S. Al-Dayaa, D.B. Megherbi Journal of supercomputing . 2012,第1期

机译：使用代理状态发生频率并分析多代理环境中代理学习过程中的知识共享的强化学习技术
3. A multi-agent reinforcement learning method with learning of other agents for competitive game [J] . Yoichiro Matsuno, Tatsuya Yamazaki, Jun Matsuda, 電子情報通信学会技術研究報告. ニュ-ロコンピュ-ティング. Neurocomputing . 2000,第688期

机译：一种多智能体强化学习方法，结合其他智能体进行竞技游戏学习
4. Restraining Bolts for Reinforcement Learning Agents [C] . Giuseppe De Giacomo, Luca Iocchi, Marco Favorito, AAAI Conference on Artificial Intelligence;AAAI Symposium on Educational Advances in Artificial Intelligence . 2020

机译：用于加固学习代理的限制螺栓
5. Hybrid learning approach based on adaptive resonance theory and reinforcement learning for computer generated agents. [D] . Ninomiya, Susumu. 2002

机译：基于自适应共振理论和针对计算机生成的主体的强化学习的混合学习方法。
6. Co-Evolution of Predator-Prey Ecosystems by Reinforcement Learning Agents [O] . Jeongho Park, Juwon Lee, Taehwan Kim, 2021

机译：加固学习代理捕食者 - 猎物生态系统的共同演变
7. Learning from Learners: Adapting Reinforcement Learning Agents to be Competitive in a Card Game [O] . Pablo Barros, Ana Tanevska, Alessandra Sciutti 2021

机译：从学习者学习：适应强化学习代理在纸牌游戏中具有竞争力

Restraining Bolts for Reinforcement Learning Agents

摘要

著录项

相似文献

相关主题

期刊订阅