Adaptive Newton Method for Empirical Risk Minimization to Statistical Accuracy

机译：经验牛顿统计最小化的自适应牛顿法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We consider empirical risk minimization for large-scale datasets. We introduce Ada Newton as an adaptive algorithm that uses Newton's method with adaptive sample sizes. The main idea of Ada Newton is to increase the size of the training set by a factor larger than one in a way that the minimization variable for the current training set is in the local neighborhood of the optimal argument of the next training set. This allows to exploit the quadratic convergence property of Newton's method and reach the statistical accuracy of each training set with only one iteration of Newton's method. We show theoretically that we can iteratively increase the sample size while applying single Newton iterations without line search and staying within the statistical accuracy of the regularized empirical risk. In particular, we can double the size of the training set in each iteration when the number of samples is sufficiently large. Numerical experiments on various datasets confirm the possibility of increasing the sample size by factor 2 at each iteration which implies that Ada Newton achieves the statistical accuracy of the full training set with about two passes over the dataset.

机译：我们考虑对大型数据集进行经验风险最小化。我们介绍Ada Newton作为一种自适应算法，该算法使用具有自适应样本大小的Newton方法。 Ada Newton的主要思想是将训练集的大小增加一个因数，从而使当前训练集的最小变量位于下一个训练集的最佳自变量的局部附近。这样就可以利用牛顿方法的二次收敛性，并且只需一次牛顿方法的迭代就可以达到每个训练集的统计精度。从理论上讲，我们可以在不进行线搜索的情况下应用单个Newton迭代来迭代地增加样本大小，并且不超出正则化经验风险的统计准确性。特别是，当样本数量足够大时，我们可以在每次迭代中将训练集的大小加倍。在各种数据集上的数值实验证实了每次迭代将样本量增加2倍的可能性，这意味着Ada Newton在数据集上进行了大约两次遍历即可达到完整训练集的统计准确性。

著录项

来源
《Annual conference on Neural Information Processing Systems》|2016年|4069-4077|共9页
会议地点
作者
Aryan Mokhtari; Hadi Daneshmand; Aurelien Lucchi; Thomas Hofmann; Alejandro Ribeiro;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Large Scale Empirical Risk Minimization via Truncated Adaptive Newton Method [J] . Mark Eisen, Aryan Mokhtari, Alejandro Ribeiro JMLR: Workshop and Conference Proceedings . 2018,第2010期

机译：截断自适应牛顿法的大规模经验风险最小化
2. Efficient Nonconvex Empirical Risk Minimization via Adaptive Sample Size Methods [J] . Aryan Mokhtari, Asuman Ozdaglar, Ali Jadbabaie JMLR: Workshop and Conference Proceedings . 2018,第12期

机译：通过自适应样本量方法实现有效的非凸经验风险最小化
3. Stochastic Adaptive Quasi-Newton Methods for Minimizing Expected Values [J] . Chaoxu Zhou, Wenbo Gao, Donald Goldfarb JMLR: Workshop and Conference Proceedings . 2017,第1期

机译：最小期望值的随机自适应拟牛顿法
4. Adaptive Newton Method for Empirical Risk Minimization to Statistical Accuracy [C] . Aryan Mokhtari, Hadi Daneshmand, Aurelien Lucchi, Annual conference on Neural Information Processing Systems . 2016

机译：适应性牛顿方法，用于统计准确性的经验风险最小化
5. Distributed Algorithms in Large-Scaled Empirical Risk Minimization: Non-convexity, Adaptive Sampling, and Matrix-Free Second-Order Methods [D] . He, Xi 2019

机译：大规模经验风险最小化的分布式算法：非凸性，自适应采样和无矩阵二阶方法
6. Accuracy and Power of Statistical Methods for Detecting Adaptive Evolution in Protein Coding Sequences and for Identifying Positively Selected Sites [O] . Wendy S. W. Wong, Ziheng Yang, Nick Goldman, 2004

机译：统计方法的准确性和力量用于检测蛋白质编码序列中的自适应进化和识别阳性选择位点
7. Dual Free Adaptive Mini-batch SDCA for Empirical Risk Minimization [O] . He, Xi, Takáč, Martin 2017

机译：用于经验风险最小化的双重自由自适应小批量sDCa
8. Statistical Learning: Stability is Sufficient for Generalization and Necessary and Sufficient for Consistency of Empirical Risk Minimization [R] . Mukherjee, S. , Niyogi, P. , Poggio, T. , 2004

机译：统计学习：稳定性对于泛化是充分的，对于经验风险最小化的一致性是必要和充分的

Adaptive Newton Method for Empirical Risk Minimization to Statistical Accuracy

摘要

著录项

相似文献

相关主题

期刊订阅