A Robust Learning Approach for Regression Models Based on Distributionally Robust Optimization

Ruidi Chen; Ioannis Ch. Paschalidis

首页> 外文期刊>Journal of machine learning research >A Robust Learning Approach for Regression Models Based on Distributionally Robust Optimization

【24h】

A Robust Learning Approach for Regression Models Based on Distributionally Robust Optimization

机译：基于分布鲁棒优化的回归模型的强大学习方法

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We present a Distributionally Robust Optimization (DRO) approach to estimate a robustified regression plane in a linear regression setting, when the observed samples are potentially contaminated with adversarially corrupted outliers. Our approach mitigates the impact of outliers by hedging against a family of probability distributions on the observed data, some of which assign very low probabilities to the outliers. The set of distributions under consideration are close to the empirical distribution in the sense of the Wasserstein metric. We show that this DRO formulation can be relaxed to a convex optimization problem which encompasses a class of models. By selecting proper norm spaces for the Wasserstein metric, we are able to recover several commonly used regularized regression models. We provide new insights into the regularization term and give guidance on the selection of the regularization coefficient from the standpoint of a confidence region. We establish two types of performance guarantees for the solution to our formulation under mild conditions. One is related to its out-of-sample behavior (prediction bias), and the other concerns the discrepancy between the estimated and true regression planes (estimation bias). Extensive numerical results demonstrate the superiority of our approach to a host of regression models, in terms of the prediction and estimation accuracies. We also consider the application of our robust learning procedure to outlier detection, and show that our approach achieves a much higher AUC (Area Under the ROC Curve) than M-estimation (Huber, 1964, 1973).

机译：当观察到的样本可能被离前星体损坏的异常值污染时，我们介绍了一种分布的稳健优化（DRO）方法来估计线性回归设置中的强烈回归平面。我们的方法通过对观察数据的概率分布对冲进行对冲来减轻异常值的影响，其中一些是对异常值分配了非常低的概率。所考虑的一组分布在Wassersein公制的意义上接近实证分布。我们表明，这种DRO配方可以放宽到凸优化问题，包括一类模型。通过为Wassersein度量选择适当的常规空间，我们能够恢复几种常用的正则化回归模型。我们向正规化术语提供新的见解，并从信心地区的角度下提供正则化系数的选择。我们在温和条件下建立两种类型的性能保障，以解决我们的配方。一个与其超出样本行为（预测偏差）有关，另一个涉及估计和真实回归平面之间的差异（估计偏见）。在预测和估计准确性方面，广泛的数值结果证明了我们对许多回归模型的方法的优越性。我们还考虑将我们的强大学习程序应用于异常检测，并显示我们的方法比M估计（Huber，1964,1973）实现了更高的AUC（ROC曲线下的区域）。

著录项

来源
《Journal of machine learning research》 |2018年第a期|共48页
作者
Ruidi Chen; Ioannis Ch. Paschalidis;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. A distributionally robust stochastic optimization-based model predictive control with distributionally robust chance constraints for cooperative adaptive cruise control under uncertain traffic conditions [J] . Zhang Shuaidong, Zhao Kuilin Transportation Research Part B: Methodological . 2020,第Auga期

机译：基于经常稳健的随机优化的模型预测控制，具有不确定交通条件下合作自适应巡航控制的分布鲁棒机会约束
2. A robust learning algorithm based on support vector regression and robust fuzzy cerebellar model articulation controller [J] . Zne-Jung Lee Applied Intelligence . 2008,第1期

机译：基于支持向量回归和鲁棒模糊小脑模型清晰度控制器的鲁棒学习算法
3. A robust learning algorithm based on support vector regression and robust fuzzy cerebellar model articulation controller [J] . Lee ZJ Applied Intelligence: The International Journal of Artificial Intelligence, Neural Networks, and Complex Problem-Solving Technologies . 2008,第1期

机译：基于支持向量回归和鲁棒模糊小脑模型清晰度控制器的鲁棒学习算法
4. A Distributionally Robust Optimization Approach for Multivariate Linear Regression under the Wasserstein Metric [C] . Ruidi Chen, Ioannis Ch. Paschalidis IEEE Annual Conference on Decision and Control . 2019

机译：Wassersein指标下的多变量线性回归的分布鲁棒优化方法
5. Distributionally Robust Optimization Models and Process Mining in Healthcare Systems. [D] . Zhang, He. 2014

机译：医疗保健系统中的分布式稳健优化模型和过程挖掘。
6. Robust Modeling of Differential Gene Expression Data Using Normal/Independent Distributions: A Bayesian Approach [O] . Mojtaba Ganjali, Taban Baghfalaki, Damon Berridge -1

机译：使用正态/独立分布对差异基因表达数据进行鲁棒建模：贝叶斯方法
7. Robust Task Learning Based on Nonlinear Regression With Mixtures of Student-t Distributions [O] . Chunzheng Cao, Ziyue Wang, Jian Qing Shi, 2020

机译：基于非线性回归与学生-T分布混合物的强大任务学习

A Robust Learning Approach for Regression Models Based on Distributionally Robust Optimization

摘要

著录项

相似文献

相关主题

期刊订阅