Analytic Study of Double Descent in Binary Classification: The Impact of Loss

机译：二元分类中双重下降的分析研究：损失的影响

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Extensive empirical evidence reveals that, for a wide range of different learning methods and data sets, the risk curve exhibits a double-descent (DD) trend as a function of the model size. In our recent coauthored paper [Deng et al., ’19], we proposed simple binary linear classification models and showed that the test error of gradient descent (GD) with logistic loss undergoes a DD. In this paper, we complement these results by extending them to GD with square loss. We show that the DD phenomenon persists, but we also identify several differences compared to logistic loss. This emphasizes that crucial features of DD curves (such as their transition threshold and global minima) depend both on the training data and on the learning algorithm. We further study the dependence of DD curves on the size of the training set. Similar to [Deng et al., ’19] our results are analytic: we plot the DD curves by first deriving sharp asymptotics for the test error under Gaussian features. Albeit simple, the models permit a principled study, the outcomes of which theoretically corroborate related empirical findings occurring in more complex learning tasks.

机译：大量的经验证据表明，对于各种不同的学习方法和数据集，风险曲线显示出作为模型大小的函数的双下降（DD）趋势。在我们最近的合着论文中[Deng等，'19]，我们提出了简单的二进制线性分类模型，并显示了具有逻辑损失的梯度下降（GD）的测试误差经历了DD。在本文中，我们通过将它们扩展到平方损失为GD来对这些结果进行补充。我们表明DD现象仍然存在，但与逻辑损失相比，我们也发现了一些差异。这强调了DD曲线的关键特征（例如其过渡阈值和全局最小值）既取决于训练数据又取决于学习算法。我们进一步研究DD曲线对训练集大小的依赖性。与[Deng等，'19]相似，我们的结果也可以分析：我们通过首先推导高斯特征下的测试误差的渐近渐近性来绘制DD曲线。这些模型虽然简单，但允许进行有原则的研究，其结果在理论上证实了在更复杂的学习任务中发生的相关经验发现。

著录项

来源
《IEEE International Symposium on Information Theory》|2020年|2527-2532|共6页
会议地点
作者
Ganesh Ramachandra Kini; Christos Thrampoulidis;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Analytical and Numerical Study of Soret and Dufour Effects on Double Diffusive Convection in a Shallow Horizontal Binary Fluid Layer Submitted to Uniform Fluxes of Heat and Mass [J] . Lagra A., Bourich M., Hasnaoui M., Mathematical Problems in Engineering . 2018,第PTa2期

机译：均匀热流通量的浅水平二元流体层中Soret和Dufour对双扩散对流效应的分析和数值研究
2. Analytical and Numerical Study of Soret and Dufour Effects on Double Diffusive Convection in a Shallow Horizontal Binary Fluid Layer Submitted to Uniform Fluxes of Heat and Mass [J] . A. Lagra, M. Bourich, M. Hasnaoui, Mathematical Problems in Engineering: Theory, Methods and Applications . 2018,第6期

机译：均匀热流通量的浅水平二元流体层中Soret和Dufour对双扩散对流影响的分析和数值研究
3. An empirical study of impact of crossover operators on the performance of non-binary genetic algorithm based neural approaches for classification [J] . Parag C. Pendharkar, James A. Rodger Computers & operations research . 2004,第4期

机译：基于非二进制遗传算法的神经分类方法对交叉算子影响的实证研究
4. Analytical drain current model to study the impact of interface trap charges on device performance of Double Gate Ge Ferroelectric FET (DGGeFeFET) [C] . Monika Bansal, Harsupreet Kaur Conference on Emerging Devices and Smart Systems . 2018

机译：分析漏极电流模型研究界面陷阱电荷对双闸GE铁电FET器件性能的影响（DGGEFEFET）
5. Comparison of conventional, modified single seed descent, and doubled haploid breeding methods for maize inbred line development using GEM breeding crosses. [D] . Jumbo, McDonald Bright. 2010

机译：常规，改良单种子后代和双单倍体育种利用GEM育种杂交技术开发玉米近交系的比较。
6. The use of one-stage meta-analytic method based on individual participant data for binary adverse events under the rule of three: a simulation study [O] . Liang-Liang Cheng, Ke Ju, Rui-Lie Cai, -1

机译：基于三项规则的基于个体参与者数据的二阶段不良事件一阶段元分析方法的使用：模拟研究
7. A model of double descent for high-dimensional binary linear classification [O] . Zeyu Deng, Abla Kammoun, Christos Thrampoulidis 2021

机译：高维二进制线性分类的双重血统模型
8. An analytical study of the steady vertical descent in autorotation of single-rotor helicopters [R] . Nikolsky, A A, Seckel, Edward 1949

机译：单旋翼直升机自转垂直下降的分析研究

Analytic Study of Double Descent in Binary Classification: The Impact of Loss

摘要

著录项

相似文献

相关主题

期刊订阅