首页> 外文会议>IEEE Global Communications Conference >ClusterGrad: Adaptive Gradient Compression by Clustering in Federated Learning
【24h】

ClusterGrad: Adaptive Gradient Compression by Clustering in Federated Learning

机译:ClusterGrad:联合学习中的聚类自适应渐变压缩

获取原文

摘要

Recently, Federated Learning (FL) has drawn tremendous attentions due to its ability to protect client's privacy. In FL, clients collaboratively train machine learning models by merely sharing intermediate computations, i.e., gradients of model parameters. However, training a complicated model involves multiple rounds of interactions between clients and the server via the Internet. Consequently, communication is a primary bottleneck of FL attributed to the poor network conditions and the large amount of interchanged computations. To overcome the communication bottleneck, we propose the ClusterGrad algorithm to compress gradients which can considerably reduce the volume of communicated computations. Our design is based on the fact that there is only a small fraction of gradients whose values are far away from the origin in each round of interaction in FL. We first identify these essential gradients that are far away from 0 using the K-means algorithm. These gradient values are approximated by a novel clustering based quantization algorithm. Then, the rest gradients lying close to 0 are approximated with a single value. We can prove that ClusterGrad outperforms the latest FL gradient compression algorithms: Probability Quantization (PQ) and Deep Gradient Compression (DGC). We conduct extensive experiments with the CIFAR-10 datasets which further demonstrate that ClusterGrad can achieve compression ratio (used interchangeably with compression rate) 123 on average in comparison with PQ and DGC with compression ratios 16 and 60 respectively.
机译:最近,联合学习(FL)由于其保护客户的隐私而引起了巨大的关注。在FL中,客户通过仅共享中间计算,即模型参数的梯度来协作机器学习模型。但是,培训复杂模型涉及客户端和服务器之间的多轮交互通过互联网。因此,通信是归因于网络条件差和大量的互换计算的FL的主要瓶颈。为了克服通信瓶颈,我们提出了ClusterGrad算法来压缩梯度,该梯度可以显着降低传达计算量。我们的设计基于以下事实,即只有一小部分梯度,其值远离来自FL中的每一轮相互作用的原点。我们首先使用K-Means算法识别远离0的这些基本梯度。这些梯度值由基于新的基于聚类的量化算法近似。然后,靠近0的静止梯度近似以单个值近似。我们可以证明ClusterGrad优于最新的FL梯度压缩算法:概率量化(PQ)和深梯度压缩(DGC)。我们对CiFar-10数据集进行了广泛的实验,进一步证明簇本分别与PQ和DGC分别具有压缩比16和60的PQ和DGC,可以平均实现压缩比(以压缩率互换)123。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号