Data Encoding for Byzantine-Resilient Distributed Optimization

Deepesh Data; Linqi Song; Suhas N. Diggavi

首页> 外文期刊>IEEE Transactions on Information Theory >Data Encoding for Byzantine-Resilient Distributed Optimization

【24h】

Data Encoding for Byzantine-Resilient Distributed Optimization

机译：拜占庭式弹性分布式优化的数据编码

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We study distributed optimization in the presence of Byzantine adversaries, where both data and computation are distributed among

$m$

worker machines,

$t$

of which may be corrupt. The compromised nodes may collaboratively and arbitrarily deviate from their pre-specified programs, and a designated (master) node iteratively computes the model/parameter vector for generalized linear models . In this work, we primarily focus on two iterative algorithms: Proximal Gradient Descent (PGD) and Coordinate Descent (CD). Gradient descent (GD) is a special case of these algorithms. PGD is typically used in the data-parallel setting, where data is partitioned across different samples, whereas, CD is used in the model-parallelism setting, where data is partitioned across the parameter space. At the core of our solutions to both these algorithms is a method for Byzantine-resilient matrix-vector (MV) multiplication; and for that, we propose a method based on data encoding and error correction over real numbers to combat adversarial attacks. We can tolerate up to

$tleq lfloor rac {m-1}{2}floor $

corrupt worker nodes, which is information-theoretically optimal. We give deterministic guarantees, and our method does not assume any probability distribution on the data. We develop a sparse encoding scheme which enables computationally efficient data encoding and decoding. We demonstrate a trade-off between the corruption threshold and the resource requirements (storage, computational, and communication complexity). As an example, for

$tleq rac {m}{3}$

, our scheme incurs only a constant overhead on these resources, over that required by the plain distributed PGD/CD algorithms which provide no adversarial protection. To the best of our knowledge, ours is the first paper that connects MV multiplication with CD and designs a specific encoding matrix for MV multiplication whose structure we can leverage to make CD secure against adversarial attacks. Our encoding scheme extends efficiently to (i) the data streaming model, in which data samples come in an online fashion and are encoded as they arrive, and (ii) making stochastic gradient descent (SGD) Byzantine-resilient. In the end, we give experimental results to show the efficacy of our proposed schemes.

机译：我们研究了拜占庭对手存在的分布式优化，其中数据和计算都分布在<内联公式XMLNS：mml =“http://www.w3.org/1998/math/mathml”xmlns：xlink =“http ：//www.w3.org/1999/xlink“> $ m $ 工人计算机，<内联公式xmlns：mml = “http://www.w3.org/1998/math/mathml”xmlns：xlink =“http://www.w3.org/1999/xlink”> $ t $ 其中可能已损坏。受损的节点可以协作地和任意地偏离其预先指定的程序，并且指定的（主机）节点迭代地计算<斜体XMLNS：MML =“http://www.w3.org/1998/math的模型/参数向量/ mathml“xmlns：xlink =”http://www.w3.org/1999/xlink“>概括的线性模型。在这项工作中，我们主要关注两个迭代算法：<斜体xmlns：mml =“http://www.w3.org/1998/math/mathml”xmlns：xlink =“http://www.w3.org/ 1999 / xlink“>近端渐变下降（pgd）和<斜体xmlns：mml =”http://www.w3.org/1998/math/mathml“xmlns：xlink =”http：// www。 w3.org/1999/xlink"etcoords vescent （CD）。梯度下降（GD）是这些算法的特殊情况。 PGD通常用于数据并行设置，其中数据在不同的样本上划分，而CD用于模型并行设置，其中数据在参数空间上划分。在我们对这两个算法的解决方案的核心，是拜占庭式弹性矩阵矢量（MV）乘法的方法;为此，我们提出了一种基于数据编码和纠错的方法，以对抗对抗攻击的实际数字。我们可以宽容到<内联公式XMLNS：mml =“http://www.w3.org/1998/math/mathml”xmlns：xlink =“http://www.w3.org/1999/xlink”> $ t Leq Lfloor FRAC {M-1} {2} rfloor $ 损坏的工作者节点，这是理论上的信息最佳。我们提供确定性保证，我们的方法不假设数据上的任何概率分布。我们开发一个<斜体XMLNS：mml =“http://www.w3.org/1998/math/mathml”xmlns：xlink =“http://www.w3.org/1999/xlink”> sparse 编码方案，其能够实现有效的数据编码和解码。我们在腐败阈值和资源需求（存储，计算和通信复杂性）之间展示了权衡。例如，对于<内联公式XMLNS：MML =“http://www.w3.org/1998/math/mathml”xmlns：xlink =“http://www.w3.org/1999/xlink”> $ t leq frac {m} {3} $ ，我们的scheme只引起常量在这些资源上的开销，在普通要求的情况下分布式PGD / CD算法，不提供对抗性保护。据我们所知，我们的是第一种用CD连接MV乘法的论文，并为MV乘法设计了一个特定的编码矩阵，其结构我们可以利用，使CD为对抗对抗攻击。我们的编码方案扩展了<斜体xmlns：mml =“http://www.w3.org/1998/math/mathml”xmlns：xlink =“http://www.w3.org/1999/xlink”>有效地到<斜体xmlns：mml =“http://www.w3.org/1998/math/mathml”xmlns：xlink =“http://www.w3.org/1999/xlink”>（i）< /斜体>数据流模型，其中数据样本以在线方式进行，并在到达时编码，以及<斜体XMLNS：MML =“http://www.w3.org/1998/math/mathml”xmlns： xlink =“http://www.w3.org/1999/xlink”>（ii）制作随机渐变血压（sgd）拜占庭式弹性。最后，我们给出了实验结果，以表明我们提出的计划的功效。

著录项

来源
《IEEE Transactions on Information Theory》 |2021年第2期|1117-1140|共24页
作者
Deepesh Data; Linqi Song; Suhas N. Diggavi;
展开▼
作者单位

University of California at Los Angeles (UCLA) Los Angeles CA USA;

University of California at Los Angeles (UCLA) Los Angeles CA USA;

City University of Hong Kong Hong Kong;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Encoding; Distributed databases; Computational modeling; Data models; Optimization; Computer architecture; Partitioning algorithms;

机译：编码;分布式数据库;计算建模;数据模型;优化;计算机架构;分区算法;

相似文献

外文文献
中文文献
专利

1. Finite-Time Guarantees for Byzantine-Resilient Distributed State Estimation With Noisy Measurements [J] . Su Lili, Shahrampour Shahin IEEE Transactions on Automatic Control . 2020,第9期

机译：具有嘈杂测量的拜占庭式弹性分布式状态估计的有限时间保证
2. Byzantine-resilient distributed observers for LTI systems [J] . Mitra Aritra, Sundaram Shreyas Automatica . 2019,第期

机译：用于LTI系统的拜占庭式弹性分布式观察者
3. DRACO: Byzantine-resilient Distributed Training via Redundant Gradients [J] . Lingjiao Chen, Hongyi Wang, Zachary Charles, JMLR: Workshop and Conference Proceedings . 2018,第2010期

机译：DRACO：通过冗余渐变拜占庭式的分布式训练
4. Data Encoding Methods for Byzantine-Resilient Distributed Optimization [C] . Deepesh Data, Linqi Song, Suhas Diggavi IEEE International Symposium on Information Theory . 2019

机译：拜占庭弹性分布优化的数据编码方法
5. DISTRIBUTED INFORMATION AND DISTRIBUTED CONTROL: CASES FROM STOCHASTIC SYSTEMS AND DATABASE MANAGEMENT (QUERY OPTIMIZATION, DISCRETE-EVENT) [D] . LAFORTUNE, STEPHANE. 1986

机译：分布式信息和分布式控制：来自随机系统和数据库管理的案例（查询优化，离散事件）
6. An optimized approach for simultaneous horizontal data fragmentation and allocation in Distributed Database Systems (DDBSs) [O] . Ali A. Amer, Adel A. Sewisy, Taha M.A. Elgendy 2017

机译：分布式数据库系统（DDBS）中同时进行水平数据分段和分配的优化方法
7. Byzantine-Resilient Distributed Optimization of Multi-Dimensional Functions [O] . Kananart Kuwaranancharoen, Lei Xin, Shreyas Sundaram 2020

机译：拜占庭式 - 弹性分布式优化多维功能

Data Encoding for Byzantine-Resilient Distributed Optimization

摘要

著录项

相似文献

相关主题

期刊订阅