Imbalanced <fc>SVM</fc>‐Based Anomaly Detection Algorithm for?Imbalanced Training Datasets

GuiPing Wang; JianXi Yang; Ren Li

首页> 外文期刊>ETRI journal >Imbalanced SVM‐Based Anomaly Detection Algorithm for?Imbalanced Training Datasets

【24h】

Imbalanced SVM‐Based Anomaly Detection Algorithm for?Imbalanced Training Datasets

机译：基于不平衡 SVM 的异常检测算法，用于不平衡训练数据集

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Abnormal samples are usually difficult to obtain in production systems, resulting in imbalanced training sample sets. Namely, the number of positive samples is far less than the number of negative samples. Traditional Support Vector Machine ( SVM )‐based anomaly detection algorithms perform poorly for highly imbalanced datasets: the learned classification hyperplane skews toward the positive samples, resulting in a high false‐negative rate. This article proposes a new imbalanced SVM (termed Im SVM )‐based anomaly detection algorithm, which assigns a different weight for each positive support vector in the decision function. Im SVM adjusts the learned classification hyperplane to make the decision function achieve a maximum GM ean measure value on the dataset. The above problem is converted into an unconstrained optimization problem to search the optimal weight vector. Experiments are carried out on both Cloud datasets and Knowledge Discovery and Data Mining datasets to evaluate Im SVM . Highly imbalanced training sample sets are constructed. The experimental results show that Im SVM outperforms over‐sampling techniques and several existing imbalanced SVM ‐based techniques.

机译：通常很难在生产系统中获取异常样本，从而导致训练样本集不平衡。即，阳性样品的数量远小于阴性样品的数量。传统的基于支持向量机（SVM）的异常检测算法在高度不平衡的数据集上表现不佳：学习到的分类超平面偏向正样本，从而导致较高的假阴性率。本文提出了一种新的基于不平衡SVM（称为Im SVM）的异常检测算法，该算法为决策函数中的每个正支持向量分配了不同的权重。 Im SVM调整学习到的分类超平面，以使决策函数在数据集上达到最大GM ean度量值。将上述问题转换为无约束的优化问题，以搜索最佳权向量。在Cloud数据集以及Knowledge Discovery和Data Mining数据集上都进行了实验，以评估Im SVM。构建高度不平衡的训练样本集。实验结果表明，Im SVM优于过采样技术和几种现有的基于SVM的不平衡技术。

著录项

来源
《ETRI journal》 |2017年第5期|共11页
作者
GuiPing Wang; JianXi Yang; Ren Li;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词

相似文献

外文文献
中文文献
专利

1. Predicting Extreme Financial Risks on Imbalanced Dataset: A Combined Kernel FCM and Kernel SMOTE Based SVM Classifier [J] . Huang Xun, Zhang Cheng-Zhao, Yuan Jia Computational economics . 2020,第1期

机译：预测Imbalyded DataSet上的极端金融风险：基于SVM分类器的组合内核FCM和内核尺
2. RNN-Based online anomaly detection in nuclear reactors for highly imbalanced datasets with uncertainty [J] . Kim Minhee, Ou Elisa, Loh Po-Ling, Nuclear Engineering and Design . 2020,第Auga期

机译：基于RNN的在线异常检测核反应堆，用于具有不确定性的高度不平衡数据集
3. Boosted Near-miss Under-sampling on SVM ensembles for concept detection in large-scale imbalanced datasets [J] . Bao Lei, Juan Cao, Li Jintao, Neurocomputing . 2016,第JANa8期

机译：支持SVM集成的增强型近缺失欠采样，用于大规模不平衡数据集中的概念检测
4. An Effective Parallel SVM Intrusion Detection Model for Imbalanced Training Datasets [C] . Jing Zhao, Jun Li, Chun Long, International Conference on Enterprise Information Systems . 2020

机译：用于不平衡训练数据集的有效并行SVM入侵检测模型
5. Active learning with support vector machines for imbalanced datasets and a method for stopping active learning based on stabilizing predictions. [D] . Bloodgood, Michael. 2009

机译：支持向量机用于不平衡数据集的主动学习，以及一种基于稳定预测的主动学习停止方法。
6. Adaptive swarm cluster-based dynamic multi-objective synthetic minority oversampling technique algorithm for tackling binary imbalanced datasets in biomedical data classification [O] . Jinyan Li, Simon Fong, Yunsick Sung, 2016

机译：生物医学数据分类中基于二元不平衡数据集的自适应群聚动态多目标综合少数抽样技术算法
7. Voice authentication based on the Russian-language dataset, MFCC method and the anomaly detection algorithm [O] . Anna Sidorova, Konstantin Kogos 2020

机译：基于俄语数据集的语音认证，MFCC方法和异常检测算法

Imbalanced SVM‐Based Anomaly Detection Algorithm for?Imbalanced Training Datasets

摘要

著录项

相似文献

相关主题

期刊订阅