Bayesian Reinforcement Learning and Bayesian Deep Learning for Blockchains With Mobile Edge Computing

Asheralieva Alia; Niyato Dusit

首页> 外文期刊>IEEE Transactions on Cognitive Communications and Networking >Bayesian Reinforcement Learning and Bayesian Deep Learning for Blockchains With Mobile Edge Computing

【24h】

Bayesian Reinforcement Learning and Bayesian Deep Learning for Blockchains With Mobile Edge Computing

机译：贝叶斯加固学习和贝叶斯深入学习，具有移动边缘计算的区块

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We present a novel game-theoretic, Bayesian reinforce-ment learning (RL) and deep learning (DL) framework to represent interactions of miners in public and consortium blockchains with mobile edge computing (MEC). Within the framework, we formulate a stochastic game played by miners under incomplete information. Each miner can offload its block operations to one of the base stations (BSs) equipped with the MEC server. The miners select their offloading BSs and block processing rates simultaneously and independently, without informing other miners about their actions. As such, no miner knows the past and current actions of others and, hence, constructs its belief about these actions. Accordingly, we devise a Bayesian RL algorithm based on the partially-observable Markov decision process for miner's decision making that allows each miner to dynamically adjust its strategy and update its beliefs through repeated interactions with each other and with the mobile environment. We also propose a novel unsupervised Bayesian deep learning algorithm where the uncertainties about unobservable states are approximated with Bayesian neural networks. We show that the proposed Bayesian RL and DL algorithms converge to the stable states where the miners' actions and beliefs form the perfect Bayesian equilibrium (PBE) and myopic PBE, respectively.

机译：我们提出了一部小说游戏理论，贝叶斯强化学习（RL）和深度学习（DL）框架，以代表矿工在公共和联盟中与移动边缘计算（MEC）的互动。在框架内，我们制定了由不完整信息下的矿工扮演的随机游戏。每个矿工可以将其块操作卸载到配备MEC服务器的基站（BSS）之一。矿工同时和独立地选择他们的卸载BSS并阻止处理率，而无需向其他矿工通知他们的行为。因此，没有矿工知道其他人的过去和目前的行为，因此构建了对这些行为的信念。因此，我们基于矿工决策的部分可观察的马尔可夫决策过程设计了一种贝叶斯RL算法，允许每个矿工通过彼此重复的交互和移动环境来动态调整其策略并更新其信仰。我们还提出了一种新颖的无人监督的贝叶斯深度学习算法，其中关于不可观察状态的不确定性近似于贝叶斯神经网络。我们展示了拟议的贝叶斯RL和DL算法分别融合到矿业行动和信仰的稳定状态，分别形成完美的贝叶斯均衡（PBE）和近视PBE。

著录项

来源
《IEEE Transactions on Cognitive Communications and Networking》 |2021年第1期|319-335|共17页
作者
Asheralieva Alia; Niyato Dusit;
展开▼
作者单位

Southern Univ Sci & Technol Dept Comp Sci & Engn Shenzhen 518055 Peoples R China;

Nanyang Technol Univ Sch Comp Sci & Engn Singapore 639798 Singapore;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Task analysis; Bayes methods; Games; Resource management; Machine learning; Protocols; Bayesian methods; blockchains; deep learning; game theory; incomplete information; machine learning; mobile edge computing; partially-observable Markov decision process; reinforcement learning; resource management;

机译：任务分析;贝叶斯方法;游戏;资源管理;机器学习;贝叶斯方法;区间;深入学习;博弈论;不完整的信息;机器学习;移动边缘计算;钢筋学习;资源管理;资源管理;资源管理;资源管理;资源管理;资源管理;资源管理;资源管理;资源管理;资源管理;资源管理;资源管理;资源管理;资源管理;资源管理;资源管理;资源管理;资源管理;

相似文献

外文文献
中文文献
专利

1. Deep Reinforcement Learning (DRL)-Based Device-to-Device (D2D) Caching With Blockchain and Mobile Edge Computing [J] . Zhang Ran, Yu F. Richard, Liu Jiang, IEEE transactions on wireless communications . 2020,第10期

机译：基于BlockChain和移动边缘计算的基于设备到设备（D2D）缓存的深增强学习（DRL）
2. Online Deep Reinforcement Learning for Computation Offloading in Blockchain-Empowered Mobile Edge Computing [J] . Xiaoyu Qiu, Luobin Liu, Wuhui Chen, IEEE Transactions on Vehicular Technology . 2019,第8期

机译：在线深度强化学习，用于区块链增强型移动边缘计算中的计算分流
3. Deep reinforcement learning assisted edge-terminal collaborative offloading algorithm of blockchain computing tasks for energy Internet [J] . Xu Siya, Liao Boxian, Yang Chao, International journal of electrical power and energy systems . 2021,第Octa期

机译：BlockChain计算任务的深度加强学习辅助边缘终端协同卸载算法
4. Deep Reinforcement Learning for Computation Offloading and Resource Allocation in Blockchain-Based Multi-UAV-Enabled Mobile Edge Computing [C] . Abegaz Mohammed, Hayla Nahom, Ayall Tewodros, International Computer Conference on Wavelet Active Media Technology and Information Processing . 2020

机译：基于区块基的多UAV的移动边缘计算中的计算卸载和资源分配的深度增强学习
5. Application Placement in Edge Computing – Optimization, Game, and Deep Reinforcement Learning [D] . Cao, Zhi. 2021

机译：边缘计算中的应用放置 - 优化，游戏和深度加固学习
6. Wearable IoT Smart-Log Patch: An Edge Computing-Based Bayesian Deep Learning Network System for Multi Access Physical Monitoring System [O] . Gunasekaran Manogaran, P. Mohamed Shakeel, H. Fouad, 2019

机译：可穿戴式物联网智能日志补丁：基于边缘计算的贝叶斯深度学习网络系统用于多路访问物理监控系统
7. Deep Reinforcement Learning and Permissioned Blockchain for Content Caching in Vehicular Edge Computing and Networks [O] . Yueyue Dai, Du Xu, Ke Zhang, 2020

机译：车辆边缘计算和网络中内容缓存的深度加强学习和允许区块链

Bayesian Reinforcement Learning and Bayesian Deep Learning for Blockchains With Mobile Edge Computing

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅