FedKL: Tackling Data Heterogeneity in Federated Reinforcement Learning by Penalizing KL Divergence

Zhijie Xie; Shenghui Song

首页> 外文期刊>IEEE Journal on Selected Areas in Communications >FedKL: Tackling Data Heterogeneity in Federated Reinforcement Learning by Penalizing KL Divergence

【24h】

FedKL: Tackling Data Heterogeneity in Federated Reinforcement Learning by Penalizing KL Divergence

机译：FedKL: Tackling Data Heterogeneity in Federated Reinforcement Learning by Penalizing KL Divergence

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相关主题

摘要

One of the fundamental issues for Federated Learning (FL) is data heterogeneity, which causes accuracy degradation, slow convergence, and the communication bottleneck issue. Although the impact of data heterogeneity on supervised FL has been widely studied, the related investigation for Federated Reinforcement Learning (FRL) is still in its infancy. In this paper, we first define the type and level of data heterogeneity for FRL systems. By inspecting the connection between the global and local objective functions, we prove that local training can benefit the global objective, if the local update is properly penalized by the total variation (TV) distance between the local and global policies. A necessary condition for the global policy to be learn-able from the local environments is also derived, which is directly related to the heterogeneity level. Based on the theoretical result, a Kullback-Leibler (KL) divergence based penalty is proposed to directly constrain the model outputs in the distribution space and the convergence proof of the proposed algorithm is also provided. By jointly penalizing the divergence of the local policy from the global policy with a global penalty and penalizing each iteration of the local training with a local penalty, the proposed method achieves a better trade-off between training speed (step size) and convergence. Experiment results on two popular Reinforcement Learning (RL) experiment platforms demonstrate the advantage of the proposed algorithm over existing methods in accelerating and stabilizing the training process with heterogeneous data.

著录项

来源
《IEEE Journal on Selected Areas in Communications》 |2023年第4期|1227-1242|共16页
作者
Zhijie Xie; Shenghui Song;
展开▼
作者单位

Department of Electronic and Computer Engineering, The Hong Kong University of Science and Technology, Sai kung, Hong Kong;

展开▼
收录信息
原文格式 PDF
正文语种英语
中图分类无线电电子学、电信技术;
关键词
Training; Convergence; Data models; Servers; Heuristic algorithms; Optimization; Linear programming;

FedKL: Tackling Data Heterogeneity in Federated Reinforcement Learning by Penalizing KL Divergence

摘要

著录项

相关主题

期刊订阅