Byzantine Fault-Tolerant Distributed Machine Learning with Norm-Based Comparative Gradient Elimination

机译：拜占庭式容错分布式机器学习，基于规范的比较梯度消除

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper considers the Byzantine fault-tolerance problem in distributed stochastic gradient descent (D-SGD) method – a popular algorithm for distributed multi-agent machine learning. In this problem, each agent samples data points independently from a certain data-generating distribution. In the fault-free case, the D-SGD method allows all the agents to learn a mathematical model best fitting the data collectively sampled by all agents. We consider the case when a fraction of agents may be Byzantine faulty. Such faulty agents may not follow a prescribed algorithm correctly, and may render traditional D-SGD method ineffective by sharing arbitrary incorrect stochastic gradients. We propose a norm-based gradient-filter, named comparative gradient elimination (CGE), that robustifies the D-SGD method against Byzantine agents. We show that the CGE gradient-filter guarantees fault-tolerance against a bounded fraction of Byzantine agents under standard stochastic assumptions, and is computationally simpler compared to many existing gradient-filters such as multi-KRUM, geometric median-of-means, and the spectral filters. We empirically show, by simulating distributed learning on neural networks, that the fault-tolerance of CGE is comparable to that of existing gradient-filters. We also empirically show that exponential averaging of stochastic gradients improves the fault-tolerance of a generic gradient-filter.

机译：本文考虑了分布式随机梯度下降（D-SGD）方法的拜占庭容错问题 - 一种分布式多智能体机学习的流行算法。在该问题中，每个代理从某个数据生成分布独立地样本数据点。在无故障情况下，D-SGD方法允许所有代理学习最佳拟合所有代理的数据的数学模型。我们考虑当百分之一的药物可能是拜占庭的错误时。这种错误的代理可能无法正确地遵循规定的算法，并且可以通过共享任意不正确的随机梯度来使传统的D-SGD方法无效。我们提出了一种基于规范的梯度过滤器，命名为比较梯度消除（CGE），其强调了对拜占庭代理的D-SGD方法。我们表明CGE梯度过滤器在标准随机假设下对拜占庭试剂的界限分数保证容错，并且与许多现有梯度过滤器（如多Krum，几何中间均值）和均值相比，计算地是更简单的光谱滤波器。通过模拟神经网络的分布式学习，我们经验显示，CGE的容错与现有梯度过滤器的容错相当。我们还经验证明，随机梯度的指数平均提高了通用梯度过滤器的容错。

著录项

来源
《IEEE/IFIP International Conference on Dependable Systems and Networks Workshops》|2021年|175-181|共7页
会议地点
作者
Nirupam Gupta; Shuo Liu; Nitin Vaidya;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Fault tolerance; Machine learning algorithms; Distance learning; Fault tolerant systems; Neural networks; Fitting; Stochastic processes;

机译：容错;机器学习算法;远程学习;容错系统;神经网络;拟合;随机过程;

相似文献

外文文献
中文文献
专利

1. Distributed Statistical Machine Learning in Adversarial Settings: Byzantine Gradient Descent [J] . Yudong Chen, Lili Su, Jiaming Xu Performance evaluation review . 2018,第1期

机译：对抗环境下的分布式统计机器学习：拜占庭式梯度下降
2. Adversary-Resilient Distributed and Decentralized Statistical Inference and Machine Learning: An Overview of Recent Advances Under the Byzantine Threat Model [J] . Yang Zhixiong, Gang Arpita, Bajwa Waheed U. IEEE Signal Processing Magazine . 2020,第3期

机译：逆境 - 弹性分布式和分散统计推理和机器学习：拜占庭威胁模型下最近进步的概述
3. Distributed Gradient Methods for Convex Machine Learning Problems in Networks: Distributed Optimization [J] . Nedic Angelia IEEE Signal Processing Magazine . 2020,第3期

机译：网络中凸机学习问题的分布式梯度方法：分布式优化
4. Byzantine-Resilient Stochastic Gradient Descent for Distributed Learning: A Lipschitz-Inspired Coordinate-wise Median Approach [C] . Haibo Yang, Xin Zhang, Minghong Fang, IEEE Annual Conference on Decision and Control . 2019

机译：分布式学习的拜占庭弹性随机梯度下降：Lipschitz激发坐标 - 明智的中位数
5. DESIGNING FAULT-TOLERANT ALGORITHMS FOR DISTRIBUTED SYSTEMS USING COMMUNICATION PRIMITIVES (BYZANTINE, AGREEMENT). [D] . SRIKANTH, T. K. 1986

机译：使用通信原理（拜占庭，协议）为分布式系统设计容错算法。
6. Unsupervised Classifications of Depression Levels Based on Machine Learning Algorithms Perform Well as Compared to Traditional Norm-Based Classifications [O] . Zhenkai Yang, Chuansheng Chen, Hanwen Li, 2020

机译：与传统的基于规范的分类相比基于机器学习算法的抑郁水平无监督分类表现良好
7. Byzantine-Resilient Stochastic Gradient Descent for Distributed Learning: A Lipschitz-Inspired Coordinate-wise Median Approach [O] . Haibo Yang, Xin Zhang, Minghong Fang, 2019

机译：分布式学习的拜占庭弹性随机梯度下降：Lipschitz激发坐标 - 明智的中位数

Byzantine Fault-Tolerant Distributed Machine Learning with Norm-Based Comparative Gradient Elimination

摘要

著录项

相似文献

相关主题

期刊订阅