Scalable Gaussian Kernel Support Vector Machines with Sublinear Training Time Complexity

Chang Feng; Shizhong Liao

首页> 外文期刊>Information Sciences: An International Journal >Scalable Gaussian Kernel Support Vector Machines with Sublinear Training Time Complexity

【24h】

Scalable Gaussian Kernel Support Vector Machines with Sublinear Training Time Complexity

机译：可扩展的高斯内核支持向量机，具有载于Sublinear培训时间复杂性

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Abstract

Gaussian kernel Support Vector Machines (SVMs) deliver state-of-the-art generalization performance for non-linear classification, but the time complexity of their training process is at least quadratic w.r.t. the size of training set, preventing them from scaling up on large datasets. To address this issue, we propose a novel approach to large-scale kernel SVMs in sublinear time w.r.t. the size of training set, which combines three well-known and efficient techniques with theoretical guarantees. First, we subsample massive samples to reduce the sample size. Then, we use the random Fourier feature mapping on the subsamples to construct explicit random feature space where we can train a linear SVM to approximate the corresponding Gaussian kernel SVM. Finally, we use parallel algorithms to make our approach more scalable. Deriving the upper bounds of kernel matrix approximation error, hypothesis error and excess risk w.r.t. the size of training set and the dimension of random feature space, we establish the theoretical foundation of our proposed approach. In this way, we can reduce the time complexity of training kernel SVMs without sacrificing much accuracy. Our proposed approach achieves high accuracy and has sublinear training time complexity, which exhibits good scalability theoretically and empirically.

]]>

机译：<！[cdata [

抽象

高斯内核支持向量机（SVMS）为非线性分类提供最先进的泛化性能，但它们的时间复杂性培训过程至少是二次WRT培训集的大小，阻止它们在大型数据集中缩放。为了解决这个问题，我们提出了一种新的讲台时间W.R.T中的大规模内核SVM的方法。培训套装的大小，结合了三种具有理论保证的众所周知和有效的技术。首先，我们将大规模的样品分开以减少样品大小。然后，我们使用附带的随机傅里叶功能映射来构建显式随机特征空间，在其中我们可以培训线性SVM以近似相应的高斯内核SVM。最后，我们使用并行算法使我们的方法更加可扩展。导出内核矩阵近似误差，假设误差和过度风险W.r.t.的上限。培训集的规模和随机特征空间的维度，我们建立了我们提出的方法的理论基础。通过这种方式，我们可以减少培训内核SVM的时间复杂性而不会牺牲大量准确性。我们所提出的方法实现了高精度，并且具有高精度的培训时间复杂性，从理论上和经验上表现出良好的可扩展性。 ]]>

著录项

来源
《Information Sciences: An International Journal 》 |2017年第2017期| 共15页
作者
Chang Feng; Shizhong Liao;
展开▼
作者单位

School of Computer Science and Technology Tianjin University;

School of Computer Science and Technology Tianjin University;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动信息理论 ; 计算机的应用 ; 信息与知识传播 ; 自动化技术、计算机技术 ;
关键词
Support Vector Machines (SVMs); Gaussian kernel; Random Fourier feature mapping; Subsampling; Parallel linear SVMs; Scalability;

机译：支持向量机（SVM）;高斯内核;随机傅里叶特征映射;数据采样;并行线性SVM;可扩展性;

相似文献

外文文献
中文文献
专利

1. Scalable Gaussian Kernel Support Vector Machines with Sublinear Training Time Complexity [J] . Chang Feng, Shizhong Liao Information Sciences: An International Journal . 2017 ,第期

机译：可扩展的高斯内核支持向量机，具有载于Sublinear培训时间复杂性
2. Pulse Waveform Classification Using Support Vector Machine with Gaussian Time Warp Edit Distance Kernel [J] . DanbingJia, DongyuZhang, NaiminLi Computational and mathematical methods in medicine . 2014 ,第2期

机译：使用支持向量机与高斯时间扭曲编辑距离内核的脉冲波形分类
3. Bias-corrected support vector machine with Gaussian kernel in high-dimension, low-sample-size settings [J] . Nakayama Yugo, Yata Kazuyoshi, Aoshima Makoto Annals of the Institute of Statistical Mathematics . 2020 ,第5期

机译：高斯内核中的偏置校正支持向量机，在高尺寸，低样本大小设置
4. Online Budgeted Stochastic Coordinate Ascent for Large-Scale Kernelized Dual Support Vector Machine Training [C] . Sahar Qaadan, Abhijeet Pendyala, Merlin Schueler, International conference on pattern recognition applications and methods . 2020

机译：在线预算随机坐标Ascent用于大型内核双重支持向量机训练
5. Selection of Gaussian kernel widths and fast cluster labeling for support vector clustering. [D] . Lee, Sei-Hyung. 2005

机译：高斯核宽度的选择和支持向量聚类的快速聚类标记。
6. Pulse Waveform Classification Using Support Vector Machine with Gaussian Time Warp Edit Distance Kernel [O] . Danbing Jia, Dongyu Zhang, Naimin Li 2014

机译：支持向量机与高斯时间扭曲编辑距离核的脉冲波形分类
7. Pulse Waveform Classification Using Support Vector Machine with Gaussian Time Warp Edit Distance Kernel [O] . Danbing Jia, Dongyu Zhang, Naimin Li 2014

机译：使用支持向量机与高斯时间扭曲编辑距离内核的脉冲波形分类

Scalable Gaussian Kernel Support Vector Machines with Sublinear Training Time Complexity

摘要

著录项

相似文献

相关主题

期刊订阅