...
首页> 外文期刊>Neurocomputing >PDFOS: PDF estimation based over-sampling for imbalanced two-class problems
【24h】

PDFOS: PDF estimation based over-sampling for imbalanced two-class problems

机译:PDFOS:针对不平衡的两类问题,基于PDF估计的过采样

获取原文
获取原文并翻译 | 示例
           

摘要

This contribution proposes a novel probability density function (PDF) estimation based over-sampling (PDFOS) approach for two-class imbalanced classification problems. The classical Parzen-window kernel function is adopted to estimate the PDF of the positive class. Then according to the estimated PDF, synthetic instances are generated as the additional training data. The essential concept is to re-balance the class distribution of the original imbalanced data set under the principle that synthetic data sample follows the same statistical properties. Based on the over-sampled training data, the radial basis function (RBF) classifier is constructed by applying the orthogonal forward selection procedure, in which the classifier's structure and the parameters of RBF kernels are determined using a particle swarm optimisation algorithm based on the criterion of minimising the leave-one-out misclassification rate. The effectiveness of the proposed PDFOS approach is demonstrated by the empirical study on several imbalanced data sets.
机译:此贡献提出了一种新颖的基于概率密度函数(PDF)估计的过采样(PDFOS)方法,用于两类不平衡分类问题。采用经典的Parzen-window核函数来估计正类的PDF。然后根据估计的PDF,生成合成实例作为附加训练数据。基本概念是在合成数据样本遵循相同统计属性的原则下,重新平衡原始不平衡数据集的类分布。基于过度采样的训练数据,通过应用正交正向选择程序构造径向基函数(RBF)分类器,其中,基于该准则的粒子群优化算法确定分类器的结构和RBF核的参数最大限度地减少遗漏的错误分类率。对几种不平衡数据集的实证研究证明了所提出的PDFOS方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号