Not All Samples Are Created Equal: Deep Learning with Importance Sampling

Angelos Katharopoulos; Francois Fleuret

首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Not All Samples Are Created Equal: Deep Learning with Importance Sampling

【24h】

Not All Samples Are Created Equal: Deep Learning with Importance Sampling

机译：并非所有样本都相等：具有重要性样本的深度学习

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Deep Neural Network training spends most of the computation on examples that are properly handled, and could be ignored. We propose to mitigate this phenomenon with a principled importance sampling scheme that focuses computation on "informative" examples, and reduces the variance of the stochastic gradients during training. Our contribution is twofold: first, we derive a tractable upper bound to the per-sample gradient norm, and second we derive an estimator of the variance reduction achieved with importance sampling, which enables us to switch it on when it will result in an actual speedup. The resulting scheme can be used by changing a few lines of code in a standard SGD procedure, and we demonstrate experimentally on image classification, CNN fine-tuning, and RNN training, that for a fixed wall-clock time budget, it provides a reduction of the train losses of up to an order of magnitude and a relative improvement of test errors between 5% and 17%.

机译：深度神经网络训练将大部分计算花费在正确处理的示例上，因此可以忽略。我们建议通过一种原则上的重要性采样方案来缓解这种现象，该方案将计算重点放在“信息性”示例上，并减少训练过程中随机梯度的方差。我们的贡献是双重的：首先，我们推导出了每个样本梯度范数的可处理上限，其次，我们推导了通过重要度采样实现的方差减少的估计量，这使我们能够在将其用于实际结果时将其打开加速。可以通过在标准SGD过程中更改几行代码来使用生成的方案，并且我们在图像分类，CNN微调和RNN训练方面进行了实验证明，对于固定的挂钟时间预算，它可以减少列车损失高达一个数量级，并且测试误差相对提高了5％至17％。

著录项

来源
《JMLR: Workshop and Conference Proceedings》 |2018年第2009期|共10页
作者
Angelos Katharopoulos; Francois Fleuret;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. Gender differences in work sample assessments: Not all tests are created equal [J] . Harari Michael B#46, Viswesvaran Chockalingam, O#39, Revista de Psicologia del Trabajo y de las Organizaciones . 2014,第1期

机译：工作样本评估中的性别差异：并非所有测试都是平等的
2. Forming a new small sample deep learning model to predict total organic carbon content by combining unsupervised learning with semisupervised learning [J] . Zhu Linqi, Zhang Chong, Zhang Chaomo, Applied Soft Computing . 2019,第期

机译：形成一个新的小样本深度学习模型，通过组合无监督学习来预测总有机碳含量
3. Deep learning-based method coupled with small sample learning for solving partial differential equations [J] . Li Ying, Mei Fangjun Multimedia Tools and Applications . 2021,第11期

机译：基于深度学习的方法，求解局部微分方程的小样本学习
4. Cluster of Equal Size Sampling Supplemented by Stratified Sample Under Ratio Estimation [C] . Li Chunping, Hao Huibing 2008年国际应用统计学术研讨会（2008 International Institute of Applied Statistics Studies）论文集 . 2008

机译：比率估计下的等量抽样聚类补充分层抽样
5. Geothermal fluid equilibrium modeling: Comparison of wellhead fluid samples to deep samples in the Reykjanes system, Iceland. [D] . Seward, Ryan J. 2014

机译：地热流体平衡模型：冰岛雷克雅未斯系统中井口流体样本与深层样本的比较。
6. β-amyloid in biological samples: not all Aβ detection methods are created equal [O] . Paul A. Adlard, Qiao-Xin Li, Catriona McLean, 2014

机译：生物样品中的β-淀粉样蛋白：并非所有的Aβ检测方法都是一样的
7. β-amyloid in biological samples: not all Aβ detection methods are created equal. [O] . Paul eAdlard, Qioa-Xin eLi, Catriona eMcLean, 2014

机译：生物样品中的β-淀粉样蛋白：并非所有的aβ检测方法都是相同的。

Not All Samples Are Created Equal: Deep Learning with Importance Sampling

摘要

著录项

相似文献

相关主题

期刊订阅