Reinforcement Learning Based on Contextual Bandits for Personalized Online Learning Recommendation Systems

Intayoad Wacharawan; Kamyod Chayapol; Temdee Punnarumol

首页> 外文期刊>Wireless personal communications: An Internaional Journal >Reinforcement Learning Based on Contextual Bandits for Personalized Online Learning Recommendation Systems

【24h】

Reinforcement Learning Based on Contextual Bandits for Personalized Online Learning Recommendation Systems

机译：基于个性化在线学习推荐系统的上下文匪徒的强化学习

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Personalized online learning has been significantly adopted in recent years and become a potential instructional strategy in online learning. The promising way to provide personalized online learning is personalized recommendation by navigating students to suitable learning contents at the right time. However, this is a nontrivial problem as the learning environments are considered as a high degree of flexibility as students independently learn according to their characteristics, and situations. Existing recommendation methods do not work effectively in such environment. Therefore, our objective of this study is to provide personalized dynamic and continuous recommendation for online learning systems. We propose the method that is based on the contextual bandits and reinforcement learning problems which work effectively in a dynamic environment. Moreover, we propose to use the past student behaviors and current student state as the contextual information to create the policy for the reinforcement agent to make the optimal decision. We deploy real data from an online learning system to evaluate our proposed method. The proposed method is compared with the well-known methods in reinforcement learning problems, i.e. epsilon-greedy, greedy optimistic initial value, and upper bound confidence methods. The results depict that our proposed method significantly performs better than those benchmarking methods in our case test.

机译：个性化在线学习近年来被广泛采用，并成为在线学习中一种潜在的教学策略。提供个性化在线学习的一种有希望的方式是通过在正确的时间引导学生选择合适的学习内容进行个性化推荐。然而，这是一个不寻常的问题，因为学习环境被认为是高度灵活的，因为学生根据自己的特点和情况独立学习。现有的推荐方法在这种环境下无法有效地工作。因此，本研究的目的是为在线学习系统提供个性化、动态和持续的推荐。我们提出了一种基于上下文盗贼和强化学习问题的方法，该方法在动态环境中有效地工作。此外，我们建议使用过去的学生行为和当前的学生状态作为上下文信息来创建策略，以便强化代理做出最佳决策。我们部署了在线学习系统中的真实数据来评估我们提出的方法。将该方法与强化学习问题中的著名方法，即epsilon贪婪法、贪婪乐观初值法和上界置信度法进行了比较。结果表明，在我们的案例测试中，我们提出的方法明显优于那些基准测试方法。

著录项

来源
《Wireless personal communications: An Internaional Journal》 |2020年第4期|共16页
作者
Intayoad Wacharawan; Kamyod Chayapol; Temdee Punnarumol;
展开▼
作者单位

Mae Fah Luang Univ Comp &

Commun Engn Capac Bldg Res Unit Sch Informat Technol Chiang Rai 57100 Thailand;

Mae Fah Luang Univ Comp &

Commun Engn Capac Bldg Res Unit Sch Informat Technol Chiang Rai 57100 Thailand;

Mae Fah Luang Univ Comp &

Commun Engn Capac Bldg Res Unit Sch Informat Technol Chiang Rai 57100 Thailand;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类无线通信;
关键词
Reinforcement learning; Personalized learning; Recommendation;

机译：加强学习;个性化学习;推荐;

相似文献

外文文献
中文文献
专利

1. Enhanced Learning Resource Recommendation Based on Online Learning Style Model [J] . Hui Chen, Chuantao Yin, Rumei Li, 清华大学学报（英文版） . 2020,第003期
2. Meta-Path-Based Deep Representation Learning for Personalized Point of Interest Recommendation [J] . LI Zhong, WU Meimei 东华大学学报（英文版） . 2021,第004期
3. Energy-Delay Tradeoff for Online Offloading Based on Deep Reinforcement Learning in Wireless Powered Mobile-Edge Computing Networks [J] . WANG Zhonglin, CAO Hankai, ZHAO Ping, 东华大学学报（英文版） . 2020,第006期
4. Online scheduling of image satellites based on neural networks and deep reinforcement learning [J] . Haijiao WANG, Zhen YANG, Wugen ZHOU, 中国航空学报（英文版） . 2019,第004期
5. Contextual Bandit Approach-based Recommendation System for Personalized Web-based Services [J] . Pilani Akshay, Mathur Kritagya, Agrawal Himanshu, Applied Artificial Intelligence . 2021,第5a8期

机译：基于语调的基于Web的服务的方法 - 基于Birt方法的推荐系统
6. A reinforcement learning approach to personalized learning recommendation systems [J] . Tang Xueying, Chen Yunxiao, Li Xiaoou, The British journal of mathematical and statistical psychology . 2019,第Pta1期

机译：个性化学习推荐系统的加强学习方法
7. Design of personalized recommendation system for online learning resources based on improved collaborative filtering algorithm [J] . Baiqiang Gan, Chi Zhang E3S Web of Conferences . 2020,第10期

机译：基于改进的协同滤波算法的在线学习资源的个性化推荐系统设计
8. A Contextual Bandit Approach to Personalized Online Recommendation via Sparse Interactions [C] . Chenyu Zhang, Hao Wang, Shangdong Yang, Pacific-Asia Conference on Knowledge Discovery and Data Mining . 2019

机译：通过稀疏交互进行个性化在线推荐的上下文强盗方法
9. Learning profiles from user interactions and personalizing recommendations based on learnt profiles. [D] . Atahan, Pelin. 2009

机译：从用户交互中学习配置文件，并根据学习到的配置文件个性化建议。
10. Personalized Learning in an Online Drugs and US Health Care System Controversies Course [O] . Sirikan Rojanasarot, Anna Milone, Rebecca Balestrieri, 2018

机译：在线毒品和美国医疗保健系统争议课程中的个性化学习
11. Exploring Clustering-Based Reinforcement Learning for Personalized Book Recommendation in Digital Library [O] . Xinhua Wang, Yuchen Wang, Lei Guo, 2021

机译：数字图书馆个性化簿推荐探索基于集群的强化学习
12. Selected Flight Test Results for Online Learning Neural Network-Based Flight Control System [R] . Williams, Peggy S. 2004

机译：基于在线学习神经网络的飞行控制系统选择飞行试验结果

Reinforcement Learning Based on Contextual Bandits for Personalized Online Learning Recommendation Systems

摘要

著录项

相似文献

相关主题

期刊订阅