首页> 外文期刊>IEEE Transactions on Information Theory >Data Encoding for Byzantine-Resilient Distributed Optimization
【24h】

Data Encoding for Byzantine-Resilient Distributed Optimization

机译:拜占庭式弹性分布式优化的数据编码

获取原文
获取原文并翻译 | 示例
           

摘要

We study distributed optimization in the presence of Byzantine adversaries, where both data and computation are distributed among $m$ worker machines, $t$ of which may be corrupt. The compromised nodes may collaboratively and arbitrarily deviate from their pre-specified programs, and a designated (master) node iteratively computes the model/parameter vector for generalized linear models . In this work, we primarily focus on two iterative algorithms: Proximal Gradient Descent (PGD) and Coordinate Descent (CD). Gradient descent (GD) is a special case of these algorithms. PGD is typically used in the data-parallel setting, where data is partitioned across different samples, whereas, CD is used in the model-parallelism setting, where data is partitioned across the parameter space. At the core of our solutions to both these algorithms is a method for Byzantine-resilient matrix-vector (MV) multiplication; and for that, we propose a method based on data encoding and error correction over real numbers to combat adversarial attacks. We can tolerate up to $tleq lfloor rac {m-1}{2}floor $ corrupt worker nodes, which is information-theoretically optimal. We give deterministic guarantees, and our method does not assume any probability distribution on the data. We develop a sparse encoding scheme which enables computationally efficient data encoding and decoding. We demonstrate a trade-off between the corruption threshold and the resource requirements (storage, computational, and communication complexity). As an example, for $tleq rac {m}{3}$ , our scheme incurs only a constant overhead on these resources, over that required by the plain distributed PGD/CD algorithms which provide no adversarial protection. To the best of our knowledge, ours is the first paper that connects MV multiplication with CD and designs a specific encoding matrix for MV multiplication whose structure we can leverage to make CD secure against adversarial attacks. Our encoding scheme extends efficiently to (i) the data streaming model, in which data samples come in an online fashion and are encoded as they arrive, and (ii) making stochastic gradient descent (SGD) Byzantine-resilient. In the end, we give experimental results to show the efficacy of our proposed schemes.
机译:我们研究了拜占庭对手存在的分布式优化,其中数据和计算都分布在<内联公式XMLNS:mml =“http://www.w3.org/1998/math/mathml”xmlns:xlink =“http ://www.w3.org/1999/xlink“> $ m $ 工人计算机,<内联公式xmlns:mml = “http://www.w3.org/1998/math/mathml”xmlns:xlink =“http://www.w3.org/1999/xlink”> $ t $ 其中可能已损坏。受损的节点可以协作地和任意地偏离其预先指定的程序,并且指定的(主机)节点迭代地计算<斜体XMLNS:MML =“http://www.w3.org/1998/math的模型/参数向量/ mathml“xmlns:xlink =”http://www.w3.org/1999/xlink“>概括的线性模型。在这项工作中,我们主要关注两个迭代算法:<斜体xmlns:mml =“http://www.w3.org/1998/math/mathml”xmlns:xlink =“http://www.w3.org/ 1999 / xlink“>近端渐变下降(pgd)和<斜体xmlns:mml =”http://www.w3.org/1998/math/mathml“xmlns:xlink =”http:// www。 w3.org/1999/xlink"etcoords vescent (CD)。梯度下降(GD)是这些算法的特殊情况。 PGD​​通常用于数据并行设置,其中数据在不同的样本上划分,而CD用于模型并行设置,其中数据在参数空间上划分。在我们对这两个算法的解决方案的核心,是拜占庭式弹性矩阵矢量(MV)乘法的方法;为此,我们提出了一种基于数据编码和纠错的方法,以对抗对抗攻击的实际数字。我们可以宽容到<内联公式XMLNS:mml =“http://www.w3.org/1998/math/mathml”xmlns:xlink =“http://www.w3.org/1999/xlink”> $ t Leq Lfloor FRAC {M-1} {2} rfloor $ 损坏的工作者节点,这是理论上的信息最佳。我们提供确定性保证,我们的方法不假设数据上的任何概率分布。我们开发一个<斜体XMLNS:mml =“http://www.w3.org/1998/math/mathml”xmlns:xlink =“http://www.w3.org/1999/xlink”> sparse 编码方案,其能够实现有效的数据编码和解码。我们在腐败阈值和资源需求(存储,计算和通信复杂性)之间展示了权衡。例如,对于<内联公式XMLNS:MML =“http://www.w3.org/1998/math/mathml”xmlns:xlink =“http://www.w3.org/1999/xlink”> $ t leq frac {m} {3} $ ,我们的scheme只引起常量在这些资源上的开销,在普通要求的情况下分布式PGD / CD算法,不提供对抗性保护。据我们所知,我们的是第一种用CD连接MV乘法的论文,并为MV乘法设计了一个特定的编码矩阵,其结构我们可以利用,使CD为对抗对抗攻击。我们的编码方案扩展了<斜体xmlns:mml =“http://www.w3.org/1998/math/mathml”xmlns:xlink =“http://www.w3.org/1999/xlink”>有效地到<斜体xmlns:mml =“http://www.w3.org/1998/math/mathml”xmlns:xlink =“http://www.w3.org/1999/xlink”>(i)< /斜体>数据流模型,其中数据样本以在线方式进行,并在到达时编码,以及<斜体XMLNS:MML =“http://www.w3.org/1998/math/mathml”xmlns: xlink =“http://www.w3.org/1999/xlink”>(ii)制作随机渐变血压(sgd)拜占庭式弹性。最后,我们给出了实验结果,以表明我们提出的计划的功效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号