Dynamic Pricing for Smart Mobile Edge Computing: A Reinforcement Learning Approach

Chen Shiyu; Li Lingxiang; Chen Zhi; Li Shaoqian

首页> 外文期刊>Wireless Communications Letters, IEEE >Dynamic Pricing for Smart Mobile Edge Computing: A Reinforcement Learning Approach

【24h】

Dynamic Pricing for Smart Mobile Edge Computing: A Reinforcement Learning Approach

机译：智能移动边缘计算的动态定价：强化学习方法

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This letter studies the revenue maximization problem for the mobile edge computing (MEC) system, where an access point (AP) is equipped with an MEC server, providing job offloading service for multiple resource-hungry users and charging users a service fee for it. Usually, the information about users' personal demand is unknown and users' job arrival rate is time-varying, which make pricing highly challenging. As such, we develop a policy gradient (PG)-based reinforcement learning (RL) algorithm. In specific, a deep neural network (DNN) is adopted as the policy network to design price policy, and a baseline neural network (BNN) is used to reduce the inherent high variance of the gradient obtained using PG. The proposed PG-based algorithm enables continuous pricing, thus constituting an advancement over the conventional Q-learning algorithm that has provided only discrete action space. Simulation results show that our proposed method converges to the optimal revenue performance, while the Q-learning algorithm suffers 44% revenue loss.

机译：这封信研究了移动边缘计算（MEC）系统的收入最大化问题，其中接入点（AP）配备了MEC服务器，为多个资源饥饿的用户提供工作卸载服务，并为用户充电服务费用。通常，有关用户个人需求的信息是未知的，用户的工作到达率是时变的，这使得定价高度挑战。因此，我们开发了一种策略梯度（PG）基础的加强学习（RL）算法。具体地，采用深度神经网络（DNN）作为策略网络来设计价格政策，并且基线神经网络（BNN）用于降低使用PG获得的梯度的固有高方差。所提出的基于PG的算法使得能够连续定价，从而构成了仅提供了仅提供了离散动作空间的传统Q学习算法的进步。仿真结果表明，我们所提出的方法会聚到最佳收入性能，而Q学习算法遭受44％的收入损失。

著录项

来源
《Wireless Communications Letters, IEEE》 |2021年第4期|700-704|共5页
作者
Chen Shiyu; Li Lingxiang; Chen Zhi; Li Shaoqian;
展开▼
作者单位

Univ Elect Sci & Technol China Natl Key Lab Sci & Technol Commun Chengdu 611731 Peoples R China;

Univ Elect Sci & Technol China Natl Key Lab Sci & Technol Commun Chengdu 611731 Peoples R China;

Univ Elect Sci & Technol China Natl Key Lab Sci & Technol Commun Chengdu 611731 Peoples R China;

Univ Elect Sci & Technol China Natl Key Lab Sci & Technol Commun Chengdu 611731 Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Pricing; Heuristic algorithms; Edge computing; Servers; Optimization; Neural networks; Computational modeling; Edge computing; multi-user offloading; dynamic pricing; revenue maximization; reinforcement learning;

机译：定价;启发式算法;边缘计算;服务器;优化;神经网络;计算建模;边缘计算;多用户卸载;动态定价;收入最大化;加强学习;

相似文献

外文文献
中文文献
专利

1. Mobile Edge Computing Against Smart Attacks with Deep Reinforcement Learning in Cognitive MIMO IoT Systems [J] . Ge Songyang, Lu Beiling, Xiao Liang, Mobile networks & applications . 2020,第5期

机译：与智能攻击的移动边缘计算在认知MIMO IOT系统中的深度增强学习
2. Delay-Aware Microservice Coordination in Mobile Edge Computing: A Reinforcement Learning Approach [J] . Wang Shangguang, Guo Yan, Zhang Ning, IEEE transactions on mobile computing . 2021,第3期

机译：移动边缘计算中的延时感知微服务协调：强化学习方法
3. Edge intelligence computing for mobile augmented reality with deep reinforcement learning approach [J] . Chen Miaojiang, Liu Wei, Wang Tian, Computer networks . 2021,第Auga4期

机译：具有深度增强学习方法的移动增强现实的边缘智能计算
4. Task Assignment in Mobile Edge Computing Networks: A Deep Reinforcement Learning Approach [C] . Mingjie Feng, Qi Zhao, Nichole Sullivan, Sensors and Systems for Space Applications Conference . 2021

机译：移动边缘计算网络中的任务分配：深增强学习方法
5. A Reinforcement Learning-based Framework for Resource Allocation and Task Assignment in Mobile Edge Computing Networks [D] . Hsieh, Li-Tse. 2021

机译：基于加强学习的移动边缘计算网络中的资源分配和任务分配框架
6. Resource Allocation for Edge Computing without Using Cloud Center in Smart Home Environment: A Pricing Approach [O] . Huan Liu, Shiyong Li, Wei Sun 2020

机译：边缘计算资源分配而不使用智能家庭环境中的云中心：定价方法
7. Scheduling for Mobile Edge Computing With Random User Arrivals—An Approximate MDP and Reinforcement Learning Approach [O] . Shanfeng Huang, Bojie Lv, Rui Wang, 2020

机译：随机用户到达移动边缘计算的调度 - 近似MDP和强化学习方法

Dynamic Pricing for Smart Mobile Edge Computing: A Reinforcement Learning Approach

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅