Extending Environments to Measure Self-reflection in Reinforcement Learning

首页> 外文期刊>journal of artificial general intelligence >Extending Environments to Measure Self-reflection in Reinforcement Learning

【24h】

Extending Environments to Measure Self-reflection in Reinforcement Learning

机译：扩展环境以衡量强化学习中的自我反思

获取原文

获取原文并翻译 | 示例

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Abstract We consider an extended notion of reinforcement learning in which the environment can simulate the agent and base its outputs on the agent’s hypothetical behavior. Since good performance usually requires paying attention to whatever things the environment’s outputs are based on, we argue that for an agent to achieve on-average good performance across many such extended environments, it is necessary for the agent to self-reflect. Thus weighted-average performance over the space of all suitably well-behaved extended environments could be considered a way of measuring how self-reflective an agent is. We give examples of extended environments and introduce a simple transformation which experimentally seems to increase some standard RL agents’ performance in a certain type of extended environment.

机译：摘要我们考虑了强化学习的扩展概念，其中环境可以模拟智能体，并将其输出基于智能体的假设行为。由于良好的性能通常需要关注环境输出所基于的任何内容，因此我们认为，对于智能体来说，要在许多此类扩展环境中实现平均良好的性能，智能体必须进行自我反思。因此，在所有适当表现良好的扩展环境中，加权平均性能可以被认为是衡量智能体自我反射程度的一种方式。我们给出了扩展环境的示例，并引入了一个简单的转换，该转换在实验上似乎提高了某些标准RL代理在某种类型的扩展环境中的性能。

著录项

来源
《journal of artificial general intelligence》 |2022年第1期|1-24|共24页
作者

展开▼
作者单位

The U.S. Securities and Exchange Commission;

KX;

InQTel;

展开▼
收录信息
原文格式 PDF
正文语种英语
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. A two-stage scheduler based on New Caledonian Crow Learning Algorithm and reinforcement learning strategy for cloud environment [J] . Zade Mohammad Hasani, Mansouri N., Javidi M. M. Journal of network and computer applications . 2022,第6期

机译：A two-stage scheduler based on New Caledonian Crow Learning Algorithm and reinforcement learning strategy for cloud environment
2. Using sim-to-real transfer learning to close gaps between simulation and real environments through reinforcement learning [J] . Yuto Ushida, Hafiyanda Razan, Shunta IshizuyaTakuto SakumaShohei Kato Artificial life and robotics . 2022,第1期

机译：Using sim-to-real transfer learning to close gaps between simulation and real environments through reinforcement learning
3. Findings on Machine Learning Reported by Investigators at Loughborough University (Perspective View of Autonomous Control In Unknown Environment: Dual Control for Exploitation and Exploration Vs Reinforcement Learning) [J] . Robotics & Machine Learning Daily News . 2022,第29期

机译：Findings on Machine Learning Reported by Investigators at Loughborough University (Perspective View of Autonomous Control In Unknown Environment: Dual Control for Exploitation and Exploration Vs Reinforcement Learning)
4. 探测机器人路径规划的基于案例的强化学习算法A Case-Based Reinforcement Learning for Probe Robot Path Planning [C] . . 2002

机译：探测机器人路径规划的基于案例的强化学习算法A Case-Based Reinforcement Learning for Probe Robot Path Planning
5. For sex or marriage: The commodification of women in William Shakespeare's "Measure for Measure" and Aphra Behn's "The Rover". [D] . Allnatt, Linsey D. 2008

机译：为性或婚姻：威廉·莎士比亚（William Shakespeare）的“量度量度”（Measure for Measure）和阿芙拉·贝恩（Aphra Behn）的“流浪者”（Rover）中的女性商品化。
6. Factors associated with occupation changes after pregnancy/delivery: result from Japan Environment Children’s pilot study [O] . Reiko Suga, Mayumi Tsuji, Rie Tanaka, 2018

机译：怀孕/分娩后与职业变化相关的因素：Japan Environment＆Children的初步研究结果
7. Students' perceptions of the educational environment measured using the Dundee Ready Education Environment Measure inventory in a dental school of Bhubaneswar city, Odisha [O] . Avinash Jnaneswar, Vinay Suresan, Kunal Jha, 2016

机译：使用Dundee Ready Education Environment measure测量学生对教育环境的看法，在Bhubaneswar市，奥里萨邦的一所牙科学校进行测量
8. Cartesian Product of a k-Extendable and an l-Extendable Graph is (k + l +1)-Extendable [R] . Gyori, E., Plummer, M. D. 1991

机译：k-extendable和l-Extendable Graph的笛卡尔积是（k + l +1） - 可扩展

Extending Environments to Measure Self-reflection in Reinforcement Learning

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅