【24h】

Streamed Learning: One-Pass SVMs

机译:流学习:单通机SVMS

获取原文

摘要

We present a streaming model for large-scale classification (in the context of e_2-SVM) by leveraging connections between learning and computational geometry. The streaming model imposes the constraint that only a single pass over the data is allowed. The e_2-SVM is known to have an equivalent formulation in terms of the minimum enclosing ball (MEB) problem, and an efficient algorithm based on the idea of core sets exists (CVM) [Tsang et al, 2005]. CVM learns a (1 + ε)-approximate MEB for a set of points and yields an approximate solution to corresponding SVM instance. However CVM works in batch mode requiring multiple passes over the data. This paper presents a single-pass SVM which is based on the minimum enclosing ball of streaming data. We show that the MEB updates for the streaming case can be easily adapted to learn the SVM weight vector in a way similar to using online stochastic gradient updates. Our algorithm performs polylogarithmic computation at each example, and requires very small and constant storage. Experimental results show that, even in such restrictive settings, we can learn efficiently in just one pass and get accuracies comparable to other state-of-the-art SVM solvers (batch and online). We also give an analysis of the algorithm, and discuss some open issues and possible extensions.
机译:我们通过利用学习和计算几何之间的连接来提供大规模分类的流模型(在E_2-SVM的上下文中)。流模型施加了仅允许单次通过数据的约束。已知E_2-SVM在最小封闭球(MEB)问题方面具有等效的制剂,并且基于核心集的思想存在的有效算法存在(CVM)[Tsang等,2005]。 CVM了解一组点的(1 +ε) - 批长MEB,并对相应的SVM实例产生近似解。然而,CVM在批处理模式下工作,需要多次通过数据。本文介绍了一个单通式SVM,基于流数据的最小封闭球。我们表明流箱的MEB更新可以很容易地适于以类似于使用在线随机梯度更新的方式学习SVM权重向量。我们的算法在每个示例下执行PolyGarithmic计算,并且需要非常小而恒定的存储。实验结果表明,即使在这种限制性设置中,我们也可以在一次通过的情况下高效学习,并获得与其他最先进的SVM求解器(批量和在线)相当的准确性。我们还对算法进行了分析,并讨论了一些开放问题和可能的扩展。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号