首页> 外文期刊>ETRI journal >Imbalanced SVM‐Based Anomaly Detection Algorithm for?Imbalanced Training Datasets
【24h】

Imbalanced SVM‐Based Anomaly Detection Algorithm for?Imbalanced Training Datasets

机译:基于不平衡 SVM 的异常检测算法,用于不平衡训练数据集

获取原文
           

摘要

Abnormal samples are usually difficult to obtain in production systems, resulting in imbalanced training sample sets. Namely, the number of positive samples is far less than the number of negative samples. Traditional Support Vector Machine ( SVM )‐based anomaly detection algorithms perform poorly for highly imbalanced datasets: the learned classification hyperplane skews toward the positive samples, resulting in a high false‐negative rate. This article proposes a new imbalanced SVM (termed Im SVM )‐based anomaly detection algorithm, which assigns a different weight for each positive support vector in the decision function. Im SVM adjusts the learned classification hyperplane to make the decision function achieve a maximum GM ean measure value on the dataset. The above problem is converted into an unconstrained optimization problem to search the optimal weight vector. Experiments are carried out on both Cloud datasets and Knowledge Discovery and Data Mining datasets to evaluate Im SVM . Highly imbalanced training sample sets are constructed. The experimental results show that Im SVM outperforms over‐sampling techniques and several existing imbalanced SVM ‐based techniques.
机译:通常很难在生产系统中获取异常样本,从而导致训练样本集不平衡。即,阳性样品的数量远小于阴性样品的数量。传统的基于支持向量机(SVM)的异常检测算法在高度不平衡的数据集上表现不佳:学习到的分类超平面偏向正样本,从而导致较高的假阴性率。本文提出了一种新的基于不平衡SVM(称为Im SVM)的异常检测算法,该算法为决策函数中的每个正支持向量分配了不同的权重。 Im SVM调整学习到的分类超平面,以使决策函数在数据集上达到最大GM ean度量值。将上述问题转换为无约束的优化问题,以搜索最佳权向量。在Cloud数据集以及Knowledge Discovery和Data Mining数据集上都进行了实验,以评估Im SVM。构建高度不平衡的训练样本集。实验结果表明,Im SVM优于过采样技术和几种现有的基于SVM的不平衡技术。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号