Adaptive Sampled Softmax with Kernel Based Sampling

Guy Blanc; Steffen Rendle

首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Adaptive Sampled Softmax with Kernel Based Sampling

【24h】

Adaptive Sampled Softmax with Kernel Based Sampling

机译：具有基于内核的采样的自适应采样Softmax

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Softmax is the most commonly used output function for multiclass problems and is widely used in areas such as vision, natural language processing, and recommendation. A softmax model has linear costs in the number of classes which makes it too expensive for many real-world problems. A common approach to speed up training involves sampling only some of the classes at each training step. It is known that this method is biased and that the bias increases the more the sampling distribution deviates from the output distribution. Nevertheless, almost all recent work uses simple sampling distributions that require a large sample size to mitigate the bias. In this work, we propose a new class of kernel based sampling methods and develop an efficient sampling algorithm. Kernel based sampling adapts to the model as it is trained, thus resulting in low bias. It can also be easily applied to many models because it relies only on the model’s last hidden layer. We empirically study the trade-off of bias, sampling distribution and sample size and show that kernel based sampling results in low bias with few samples.

机译：Softmax是用于多类问题的最常用输出函数，并广泛用于视觉，自然语言处理和推荐等领域。 softmax模型在类数上具有线性成本，这使其对于许多实际问题而言过于昂贵。加快培训速度的常见方法是在每个培训步骤中仅抽样一些课程。众所周知，这种方法是有偏见的，并且随着采样分布偏离输出分布的增加，偏见也会增加。但是，几乎所有最新工作都使用简单的采样分布，这些采样需要较大的样本量才能减轻偏差。在这项工作中，我们提出了一种新的基于内核的采样方法，并开发了一种有效的采样算法。基于内核的采样在训练时会适应模型，从而导致较低的偏差。由于它仅依赖于模型的最后一个隐藏层，因此也可以轻松地应用于许多模型。我们经验地研究了偏差，采样分布和样本量之间的权衡，并表明基于内核的采样导致少量样本的低偏差。

著录项

来源
《JMLR: Workshop and Conference Proceedings》 |2018年第2010期|共10页
作者
Guy Blanc; Steffen Rendle;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. Kernel-based adaptive sampling for image reconstruction and meshing [J] . Zichun Zhong, Jing Hua Computer Aided Geometric Design . 2016,第mara期

机译：基于核的自适应采样用于图像重建和网格划分
2. Gaussian prior based adaptive synthetic sampling with non-linear sample space for imbalanced learning [J] . Knowledge-Based Systems . 2020,第Mara5期

机译：基于高斯先验的具有非线性样本空间的自适应合成样本，用于不平衡学习
3. Dynamic sampling rate algorithm (DSRA) implemented in self-adaptive software architecture: a way to reduce the energy consumption of wireless sensors through event-based sampling [J] . Algabroun Hatem Microsystem technologies . 2020,第4期

机译：在自适应软件架构中实现的动态采样率算法（DSRA）：通过基于事件的采样减少无线传感器能量消耗的方法
4. Kernel-Induced Sampling Theorem for Translation-Invariant Reproducing Kernel Hilbert Spaces with Uniform Sampling [C] . Akira Tanaka IEEE International Conference on Acoustics, Speech and Signal Processing . 2018

机译：内核诱导翻译不变再现内核的采样定理与统一采样的核心核
5. Adaptive kernel method of importance sampling. [D] . Wang, Grace Shuchuan. 1993

机译：重要性抽样的自适应核方法。
6. Rapidly-Exploring Adaptive Sampling Tree*: A Sample-Based Path-Planning Algorithm for Unmanned Marine Vehicles Information Gathering in Variable Ocean Environments [O] . Chengke Xiong, Hexiong Zhou, Di Lu, 2020

机译：快速探索的自适应采样树*：基于样本的路径规划算法用于在变化的海洋环境中进行无人驾驶船舶信息的收集
7. Autonomous spatially-adaptive sampling in experiments based on curvature, statistical error and sample spacing with applications in LDA measurements [O] . Theunissen Raf, Kadosh Jesse S, Allen Christian B 2015

机译：基于曲率，统计误差和样本间距的实验空间自适应采样及其在LDA测量中的应用
8. Normal Kernel Coupler: An Adaptive Markov Chain Monte Carlo Method for Efficiently Sampling From Multi-Modal Distributions [R] . Warnes, G. R. 2001

机译：正态核耦合器：一种从多模态分布中有效采样的自适应马尔可夫链蒙特卡罗方法

Adaptive Sampled Softmax with Kernel Based Sampling

摘要

著录项

相似文献

相关主题

期刊订阅