首页> 外文期刊>IEEE Transactions on Cognitive Communications and Networking >Bayesian Reinforcement Learning and Bayesian Deep Learning for Blockchains With Mobile Edge Computing
【24h】

Bayesian Reinforcement Learning and Bayesian Deep Learning for Blockchains With Mobile Edge Computing

机译:贝叶斯加固学习和贝叶斯深入学习,具有移动边缘计算的区块

获取原文
获取原文并翻译 | 示例

摘要

We present a novel game-theoretic, Bayesian reinforce-ment learning (RL) and deep learning (DL) framework to represent interactions of miners in public and consortium blockchains with mobile edge computing (MEC). Within the framework, we formulate a stochastic game played by miners under incomplete information. Each miner can offload its block operations to one of the base stations (BSs) equipped with the MEC server. The miners select their offloading BSs and block processing rates simultaneously and independently, without informing other miners about their actions. As such, no miner knows the past and current actions of others and, hence, constructs its belief about these actions. Accordingly, we devise a Bayesian RL algorithm based on the partially-observable Markov decision process for miner's decision making that allows each miner to dynamically adjust its strategy and update its beliefs through repeated interactions with each other and with the mobile environment. We also propose a novel unsupervised Bayesian deep learning algorithm where the uncertainties about unobservable states are approximated with Bayesian neural networks. We show that the proposed Bayesian RL and DL algorithms converge to the stable states where the miners' actions and beliefs form the perfect Bayesian equilibrium (PBE) and myopic PBE, respectively.
机译:我们提出了一部小说游戏理论,贝叶斯强化学习(RL)和深度学习(DL)框架,以代表矿工在公共和联盟中与移动边缘计算(MEC)的互动。在框架内,我们制定了由不完整信息下的矿工扮演的随机游戏。每个矿工可以将其块操作卸载到配备MEC服务器的基站(BSS)之一。矿工同时和独立地选择他们的卸载BSS并阻止处理率,而无需向其他矿工通知他们的行为。因此,没有矿工知道其他人的过去和目前的行为,因此构建了对这些行为的信念。因此,我们基于矿工决策的部分可观察的马尔可夫决策过程设计了一种贝叶斯RL算法,允许每个矿工通过彼此重复的交互和移动环境来动态调整其策略并更新其信仰。我们还提出了一种新颖的无人监督的贝叶斯深度学习算法,其中关于不可观察状态的不确定性近似于贝叶斯神经网络。我们展示了拟议的贝叶斯RL和DL算法分别融合到矿业行动和信仰的稳定状态,分别形成完美的贝叶斯均衡(PBE)和近视PBE。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号