Theoretically-Grounded Policy Advice from Multiple Teachers in Reinforcement Learning Settings with Applications to Negative Transfer

机译：来自多个教师的理论上接地的政策建议，并在加强学习环境中，应用于负转移

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Policy advice is a transfer learning method where a student agent is able to learn faster via advice from a teacher. However, both this and other reinforcement learning transfer methods have little theoretical analysis. This paper formally defines a setting where multiple teacher agents can provide advice to a student and introduces an algorithm to leverage both autonomous exploration and teacher's advice. Our regret bounds justify the intuition that good teachers help while bad teachers hurt. Using our formalization, we are also able to quantify, for the first time, when negative transfer can occur within such a reinforcement learning setting.

机译：政策建议是一项转移学习方法，学生代理能够通过教师的建议更快地学习。然而，这两个和其他加强学习转移方法都几乎没有理论分析。本文正式定义了多个教师代理商可以向学生提供建议并介绍一种算法，以利用自主探索和教师的建议。我们的遗憾界定了良好的教师帮助的直觉，而糟糕的教师受伤。使用我们的形式化，我们还能够在这种加强学习环境中发生负转移时第一次量化。

著录项

来源
《International Joint Conference on Artificial Intelligence》|2016年|1816-2738p|共7页
会议地点
作者
Yusen Zhan; Haitham Bou Ammar; Matthew E. Taylor;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词

相似文献

外文文献
中文文献
专利

1. Reinforcement learning of motor skills using Policy Search and human corrective advice [J] . The International journal of robotics research . 2019,第14期

机译：使用策略搜索和人工纠正建议加强运动技能的学习
2. Learning adversarial policy in multiple scenes environment via multi-agent reinforcement learning [J] . Li Yang, Wang Xinzhi, Wang Wei, Connection Science . 2021,第3期

机译：通过多功能钢筋学习在多个场景环境中学习对抗性政策
3. Towards integrated dialogue policy learning for multiple domains and intents using Hierarchical Deep Reinforcement Learning [J] . Saha Tulika, Gupta Dhawal, Saha Sriparna, Expert Systems with Application . 2020,第Deca期

机译：利用分层深度加强学习对多个域和意图的综合对话政策学习
4. Theoretically-Grounded Policy Advice from Multiple Teachers in Reinforcement Learning Settings with Applications to Negative Transfer [C] . Yusen Zhan, Haitham Bou Ammar, Matthew E. Taylor International Joint Conference on Artificial Intelligence . 2016

机译：来自多个教师的理论上接地的政策建议，并在加强学习环境中，应用于负转移
5. Policy advice, non-convex and distributed optimization in reinforcement learning [D] . Zhan, Yusen. 2016

机译：强化学习中的政策建议，非凸和分布式优化
6. Federated Reinforcement Learning for Training Control Policies on Multiple IoT Devices [O] . Hyun-Kyo Lim, Ju-Bong Kim, Joo-Seong Heo, 2020

机译：联合强化学习用于在多个IoT设备上训练控制策略
7. Children’s Perceptions of Tests: A Content Analysis Gokce Bulgan 10.12973/eu-jer.7.2.159 Pages: 159-167 The Relationships between Quality of Work Life, School Alienation, Burnout, Affective Commitment and Organizational Citizenship: A Study on Teachers Huseyin Akar 10.12973/eu-jer.7.2.169 Pages: 169-180 Determination of Teacher Candidates’ Views Concerning V Diagrams Used in General Biology Laboratories Kadriye Kayacan 10.12973/eu-jer.7.2.181 Pages: 181-187 Use of Instructional Technologies by Teachers in the Educational Process: Metaphor Analysis Study Hakan Sarac 10.12973/eu-jer.7.2.189 Pages: 189-202 The Analysis of Kutadgu Bilig in Terms of Values Education Aysegul Tural 10.12973/eu-jer.7.2.203 Pages: 203-209 Investigating the Resilience Levels of Parents with Children with Multiple Disabilities Based on Different Variables Sinem Kadi, Muzeyyen Eldeniz Cetin 10.12973/eu-jer.7.2.211 Pages: 211-223 Examination of Postgraduate Theses on History Textbooks in Turkey in Terms of Some Variables Eray Alaca 10.12973/eu-jer.7.2.225 Pages: 225-232 Influences of Technology Integrated Professional Development Course on Mathematics Teachers Umit Kul 10.12973/eu-jer.7.2.233 Pages: 233-243 The Book of My Dreams Hatice Degirmenci Gundogmus 10.12973/eu-jer.7.2.245 Pages: 245-249 Student Definitions of Intercultural Competence (IC)- Are They Context-Specific? Nadine Binder, Ozen Odag, Anne Leiser, Lisa Ludders, Karina Karolina Kedzior 10.12973/eu-jer.7.2.251 Pages: 251-265 Investigation of the Visuals Associated with the National identity in Turkish Republic Revolution History and Kemalism Textbooks Mehmet Elban 10.12973/eu-jer.7.2.267 Pages: 267-279 The Interplay of Emotional Instability and Socio-Environmental Aspects of Schools during Adolescence Alexander Lätsch 10.12973/eu-jer.7.2.281 Pages: 281-293 Problems of Gifted and Talented Students Regarding Cursive Handwriting: Parent Opinions Hatice Kadioglu Ates 10.12973/eu-jer.7.2.295 Pages: 295-301 A Study of Curriculum Literacy and Information Literacy Levels of Teacher Candidates in Department of Social Sciences Education Serhat Sural, Nurhak Cem Dedebali 10.12973/eu-jer.7.2.303 Pages: 303-317 Why Should Bilingualized Dictionary of Turkish Be Used in Teaching Turkish as a Foreign Language? Sami Baskin 10.12973/eu-jer.7.2.319 Pages: 319-327 Alternative Observation Tools for the Scope of Contemporary Education Supervision: An Action Research Saadet Kuru Cetin 10.12973/eu-jer.7.2.329 Pages: 329-340 The Reflection of Neoliberal Economic Policies on Education: Privatization of Education in Turkey Arslan Bayram 10.12973/eu-jer.7.2.341 Pages: 341-347 Exploring Prospective Teachers’ Reflections in the Context of Conducting Clinical Interviews Rukiye Didem Taylan 10.12973/eu-jer.7.2.349 Pages: 349-358 Consistency between Constructivist Profiles and Instructional Practices of Prospective Physics Teachers Ozlem Ates, Gul Unal Coban, Serap Kaya Sengoren 10.12973/eu-jer.7.2.359 Pages: 359-372 Fraction Multiplication and Division Word Problems Posed by Different Years of Pre-Service Elementary Mathematics Teachers Tuba Aydogdu Iskenderoglu 10.12973/eu-jer.7.2.373 Pages: 373-385 The Effect of 7E Learning Model on Conceptual Understandings of Prospective Science Teachers on "de Broglie Matter Waves" Subject [O] . 2018

机译：测试的儿童的看法：内容分析戈斯布尔根10.12973 / EU-jer.7.2.159页：研究教师侯赛因·阿卡尔10.12973：159-167工作生活，学校异化，倦怠感，感情承诺和组织公民的质量之间的关系/eu-jer.7.2.169页：169-180测定教师候选人查看关于V图中使用的普通生物学实验室Kadriye Kayacan 10.12973 / EU-jer.7.2.181页数：181-187使用教学技术的由教师在教育过程：隐喻分析研究哈坎Sarac 10.12973 / EU-jer.7.2.189页：福乐智慧的189-202的分析条款价值观教育Aysegul图拉尔10.12973 / EU-jer.7.2.203页数：203-209调查家长的复原力水平与儿童多重残疾基于不同的变量Sinem卡迪Muzeyyen Eldeniz切廷10.12973 / EU-jer.7.2.211页：关于历史教科书在土耳其研究生论文的211-223考试中的一些条款的瓦尔iables ERAY阿拉卡10.12973 / EU-jer.7.2.225页：225-232数学教师乌米特库尔10.12973 / EU-jer.7.2.233页数影响技术的综合专业发展课程：233-243我梦想中的书Hatice Degirmenci Gundogmus 10.12973 / EU-jer.7.2.245页：跨文化能力（IC）的245-249学生的定义 - 他们上下文的具体情况？纳丁活页夹，澳升Odag，安妮·莱泽，丽莎Ludders，卡琳娜卡罗利纳Kedzior 10.12973 / EU-jer.7.2.251页：与土耳其共和国革命历史和基马尔主义教科书穆罕默德Elban 10.12973国家身份的视觉效果相关的251-265调查/欧盟jer.7.2.267页：青春期亚历山大·拉奇10.12973 / EU-jer.7.2.281页面中学校的情绪不稳定和社会环境方面的267-279的互动性：优秀的281-293问题天赋的学生对于行草手写：家长意见Hatice Kadioglu阿泰10.12973 / EU-jer.7.2.295页：教师候选人的课程素养和信息素养水平在社会科学系教育塞尔哈特腓肠，295-301研究Nurhak杰姆Dedebali 10.12973 / EU-耶.7.2.303页数：土耳其的303-317为什么要Bilingualized字典是在教学中使用土耳其语作为一门外语？萨米巴斯10.12973 / EU-jer.7.2.319页：319-327替代观测工具对当代教育督导的范围：一个行动研究Saadet库鲁切廷10.12973 / EU-jer.7.2.329页数：329-340的反思教育新自由主义经济政策：在土耳其阿尔斯兰拜拉姆10.12973 / EU-jer.7.2.341页面教育的私有化：341-347中进行临床访谈Rukiye DIDEM泰兰10.12973 / EU-jer.7.2背景下探索未来教师的思考。 349页：建构概况和前景物理教师Ozlem阿泰的教学实践，古尔ÜNAL科班之间349-358一致性SERAP卡亚Sengoren 10.12973 / EU-jer.7.2.359页数：359-372馏分乘法和除法文字题所构成的不同几年前服务初等数学教师图拔Aydogdu Iskenderoglu 10.12973 / EU-jer.7.2.373页数：373-385前瞻科学教师的概念理解7E学习模式的影响“德布罗意物质波”主题

Theoretically-Grounded Policy Advice from Multiple Teachers in Reinforcement Learning Settings with Applications to Negative Transfer

摘要

著录项

相似文献

相关主题

期刊订阅