A Kernel-Based Two-Class Classifier for Imbalanced Data Sets

Hong X.; Chen S.; Harris C. J.

首页> 外文期刊>IEEE Transactions on Neural Networks >A Kernel-Based Two-Class Classifier for Imbalanced Data Sets

【24h】

A Kernel-Based Two-Class Classifier for Imbalanced Data Sets

机译：基于内核的两类分类器，用于不平衡数据集

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Many kernel classifier construction algorithms adopt classification accuracy as performance metrics in model evaluation. Moreover, equal weighting is often applied to each data sample in parameter estimation. These modeling practices often become problematic if the data sets are imbalanced. We present a kernel classifier construction algorithm using orthogonal forward selection (OFS) in order to optimize the model generalization for imbalanced two-class data sets. This kernel classifier identification algorithm is based on a new regularized orthogonal weighted least squares (ROWLS) estimator and the model selection criterion of maximal leave-one-out area under curve (LOO-AUC) of the receiver operating characteristics (ROCs). It is shown that, owing to the orthogonalization procedure, the LOO-AUC can be calculated via an analytic formula based on the new regularized orthogonal weighted least squares parameter estimator, without actually splitting the estimation data set. The proposed algorithm can achieve minimal computational expense via a set of forward recursive updating formula in searching model terms with maximal incremental LOO-AUC value. Numerical examples are used to demonstrate the efficacy of the algorithm

机译：许多内核分类器构造算法在模型评估中采用分类精度作为性能指标。此外，在参数估计中通常将相等的加权应用于每个数据样本。如果数据集不平衡，则这些建模实践通常会出现问题。我们提出一种使用正交前向选择（OFS）的内核分类器构造算法，以优化不平衡两类数据集的模型概括。该内核分类器识别算法基于新的正则化的正交加权最小二乘（ROWLS）估计器和接收器工作特性（ROC）的曲线下最大留一留面积模型选择标准（LOO-AUC）。结果表明，由于采用了正交化程序，因此可以基于解析的公式基于新的正则化的正交加权最小二乘参数估计器来计算LOO-AUC，而无需实际拆分估计数据集。在搜索具有最大增量LOO-AUC值的模型项时，该算法可以通过一组前向递归更新公式来实现最小的计算开销。数值例子证明了该算法的有效性。

著录项

来源
《IEEE Transactions on Neural Networks》 |2007年第2007期|p.28-41|共14页
作者
Hong X.; Chen S.; Harris C. J.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词
least squares approximations; parameter estimation; pattern classification; sensitivity analysis; imbalanced data sets; kernel-based two-class classifier; model selection criterion; orthogonal forward selection; parameter estimation; receiver operating characteri;

机译：最小二乘近似;参数估计;模式分类;灵敏度分析;不平衡数据集;基于核的两类分类器;模型选择准则;正交正向选择;参数估计;接收机工作特性;

相似文献

外文文献
中文文献
专利

1. Feature Selection and Ensemble Learning Techniques in One-Class Classifiers: An Empirical Study of Two-Class Imbalanced Datasets [J] . Chih-Fong Tsai, Wei-Chao Lin Quality Control, Transactions . 2021,第1期

机译：单级分类器中的特征选择和集合学习技术：两级不平衡数据集的实证研究
2. A combined SMOTE and PSO based RBF classifier for two-class imbalanced problems [J] . Ming Gao, Xia Hong, Sheng Chen, Neurocomputing . 2011,第17期

机译：基于SMOTE和PSO的组合RBF分类器可解决两类不平衡问题
3. Parameter Optimization of Kernel-based One-class Classifier on Imbalance Learning [J] . Ling Zhuang, Honghua Dai Journal of Computers . 2006,第7期

机译：基于内核的一个级别分类器对不平衡学习的参数优化
4. A kernel-based sampling to train SVM with imbalanced data set [C] . Zeng ZhiQiang, Zhu ShunZhi IEEE International Conference on Computer-Aided Industrial Design Conceptual Design . 2014

机译：基于内核的采样，用于使用不平衡数据集训练SVM
5. Kernel-based nonparametric testing in high-dimensional data with applications to gene set analysis. [D] . He, Tao. 2015

机译：在高维数据中基于核的非参数测试及其在基因组分析中的应用。
6. Comparing the performance of meta-classifiers—a case study on selected imbalanced data sets relevant for prediction of liver toxicity [O] . Sankalp Jain, Eleni Kotsampasakou, Gerhard F. Ecker -1

机译：比较元分类器的性能-以与肝毒性预测相关的选定不平衡数据集为例
7. A kernel-based two-class classifier for imbalanced data sets [O] . Hong X., Chen S., Harris C.J. 2007

机译：用于不平衡数据集的基于内核的两类分类器

A Kernel-Based Two-Class Classifier for Imbalanced Data Sets

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅