Fast SVM training using data reconstruction for classification of very large datasets

首页> 外文期刊>IEEJ Transactions on Electrical and Electronic Engineering >Fast SVM training using data reconstruction for classification of very large datasets

【24h】

Fast SVM training using data reconstruction for classification of very large datasets

机译：使用数据重建进行快速SVM培训，以分类非常大的数据集

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Abstract This paper proposes a fast support vector machine (SVM) training method for the classification of very large datasets using data reconstruction. The idea is to scale down the training data by removing the samples that have low probability to become support vectors (SVs) in the feature space. For this purpose, it applies a series of gradually refined rough SVM classifiers with a quasi‐linear kernel to build rough separation boundaries and remove those samples that are far away from the boundary. In order to make the proposed algorithm efficient for both low‐dimensional and high‐dimensional datasets, efforts are made on three aspects. The first one is to compose a quasi‐linear kernel using the information of data manifold and potential separation boundary such that the samples mapped to feature space keep a sparse distribution, especially in the direction perpendicular to the separation boundary. The second one is to avoid computing Euclidean distances between samples, which may lose its effect on very high dimensional datasets when mapping the samples to feature space and selecting the samples for training data reconstruction. The third one is to design a sophisticated iterative algorithm to gradually refine the rough SVM classifier so as to remove non‐SVs efficiently. The proposed fast SVM training method is applied to different real‐world large datasets and compared with different methods, and simulation results confirm the effectiveness of the proposed method, especially for very high dimensional datasets. © 2019 Institute of Electrical Engineers of Japan. Published by John Wiley & Sons, Inc.

机译：Abstract 本文提出了一种快速支持向量机（SVM）训练方法，用于分类非常大的使用数据重建的数据集。这个想法是通过删除在特征空间中成为支持向量（SVS）的样本来缩小训练数据的规模。为此，它应用了一系列带有准线性内核的逐渐完善的粗糙SVM分类器来构建粗糙的分离边界，并删除那些远离边界的样品。为了使所提出的算法有效地为低维和高维数据集提供了努力，在三个方面进行了努力。第一个是使用数据歧管和电势分离边界的信息组成准线性内核，以便映射到特征空间的样品保持稀疏分布，尤其是沿垂直于分离边界的方向。第二个是避免计算样品之间的欧几里得距离，在映射样品以配备空间并选择样品以训练数据重建时，这可能会失去对非常高维数据集的影响。第三个是设计一种复杂的迭代算法，以逐步完善粗糙的SVM分类器，从而有效地删除非SVS。提出的快速SVM训练方法应用于不同的实际世界大数据集，并与不同的方法进行了比较，并且模拟结果证实了该方法的有效性，尤其是对于非常高维数据集的有效性。 ©2019日本电气工程师研究所。由John Wiley＆amp出版Sons，Inc。

著录项

来源
《IEEJ Transactions on Electrical and Electronic Engineering》 |2020年第3期|372-381|共10页
作者

展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种英语
中图分类 E33;
关键词
boundary layer; Eigenvector; Taxonomydata reconstructionTraining methodsSupport Vector NetworkReconstructive Surgical ProceduresDataset;

机译：边界层;特征向量;分类学重建培养方法support vector network recstructive手术程序dataset;

相似文献

外文文献
中文文献
专利

1. Fuzzy integral-based ELM ensemble for imbalanced big data classification [J] . Junhai Zhai, Sufang Zhang, Mingyang Zhang, Soft computing: A fusion of foundations, methodologies and applications . 2018,第11期

机译：基于模糊的基于积分的ELM合奏，用于Big Data Classification
2. A Novel Semi-supervised SVM Based on Tri-training for Intrusition Detection [J] . Jimin Li1, 2, Wei Zhang2, Journal of Computers . 2010,第4期

机译：一种基于Tri-Training进行侵袭检测的新型半监控SVM
3. Performance analysis of sentiments in Twitter dataset using SVM models [J] . Lakshmana Kumar Ramasamy, Seifedine Kadry, Yunyoung Nam, International Journal of Electrical and Computer Engineering . 2021,第3期

机译：使用SVM模型的Twitter DataSet情绪分析
4. Introducing a 'Difficulty Factor' for Dataset Bi-classification using SVM [C] . Luis GONZALEZ-ABRIL, Cecilio ANGULO International Conference of the Catalan Association for Artificial Intelligence . 2013

机译：使用SVM引入DataSet Bi-Classification的“难度因子”
5. Supplemental Information 9: Code SIPAKMED Dataset Multiclass Classification VGG-19. [O] . -1

机译：补充信息9：代码SIPAKMED DataSet Multiclass Classification VGG-19。

Fast SVM training using data reconstruction for classification of very large datasets

摘要

著录项

相似文献

相关主题

期刊订阅