A Model of Double Descent for High-Dimensional Logistic Regression

机译：高维Logistic回归的双下降模型

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We consider a model for logistic regression where only a subset of features of size p is used for training a linear classifier over n training samples. The classifier is obtained by running gradient-descent (GD) on the logistic-loss. For this model, we investigate the dependence of the classification error on the overparameterization ratio κ = p. First, building on known deterministic results on convergence properties of the GD, we uncover a phase-transition phenomenon for the case of Gaussian features: the classification error of GD is the same as that of the maximum-likelihood (ML) solution when κ < κ⋆, and that of the max-margin (SVM) solution when κ > κ⋆. Next, using the convex Gaussian min-max theorem (CGMT), we sharply characterize the performance of both the ML and SVM solutions. Combining these results, we obtain curves that explicitly characterize the test error of GD for varying values of κ. The numerical results validate the theoretical predictions and unveil "double-descent" phenomena that complement similar recent observations in linear regression settings.

机译：我们考虑用于逻辑回归的模型，其中仅大小为p的特征子集用于训练n个训练样本上的线性分类器。通过对逻辑损失运行梯度下降（GD）获得分类器。对于此模型，我们研究了分类误差对过参数化比率κ= p / n的依赖性。首先，基于已知的GD收敛性的确定性结果，我们发现了高斯特征情况下的相变现象：当κ<时，GD的分类误差与最大似然（ML）解相同。 κ ⋆ ，以及当κ>κ时的最大保证金（SVM）解的值 ⋆ 。接下来，使用凸高斯最小极大定理（CGMT），我们清晰地描述了ML和SVM解决方案的性能。结合这些结果，我们获得了曲线，这些曲线明确地表征了针对κ值变化的GD的测试误差。数值结果验证了理论预测并揭示了“双下降”现象，该现象补充了线性回归设置中类似的近期观察结果。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2020年|4267-4271|共5页
会议地点
作者
Zeyu Deng; Abla Kammoun; Christos Thrampoulidis;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Generalization error; Binary Classification; Overparameterization; Max-margin; Asymptotics;

机译：泛化误差;二进制分类;超参数化;最大余量;渐近;

相似文献

外文文献
中文文献
专利

1. Robust Coordinate Descent Algorithm Robust Solution Path for High-dimensional Sparse Regression Modeling [J] . Park H., Konishi S. Communications in Statistics . 2016,第1a2期

机译：高维稀疏回归建模的鲁棒坐标下降算法鲁棒求解路径
2. Global and Simultaneous Hypothesis Testing for High-Dimensional Logistic Regression Models [J] . Ma Rong, Cai T. Tony, Li Hongzhe Journal of the American statistical association . 2021,第534期

机译：高维逻辑回归模型的全局和同步假设检测
3. Classification of High-Dimensional Data with Ensemble of Logistic Regression Models [J] . Lim N, Ahn H, Moon H, Journal of biopharmaceutical statistics . 2010,第1期

机译：利用Logistic回归模型集成对高维数据进行分类
4. A Model of Double Descent for High-Dimensional Logistic Regression [C] . Zeyu Deng, Abla Kammoun, Christos Thrampoulidis IEEE International Conference on Acoustics, Speech and Signal Processing . 2020

机译：高维逻辑回归双下降模型
5. A study of the behaviors of test statistics in the binary logistic regression model, the proportional odds ordinal logistic regression model, and the proportional hazards model [D] . McBride, Mark Leon 2000

机译：二元逻辑回归模型，比例赔率序数逻辑回归模型和比例风险模型中检验统计量行为的研究
6. Group descent algorithms for nonconvex penalized linear and logistic regression models with grouped predictors [O] . Patrick Breheny, Jian Huang -1

机译：具有分组预测变量的非凸惩罚线性和逻辑回归模型的组下降算法
7. Group descent algorithms for nonconvex penalized linear and logistic regression models with grouped predictors [O] . Patrick Breheny, Jian Huang 2016

机译：具有分组预测因子的非凸惩罚线性和逻辑回归模型的群下降算法

A Model of Double Descent for High-Dimensional Logistic Regression

摘要

著录项

相似文献

相关主题

期刊订阅