首页> 外文会议>Discovery science >Barricaded Boundary Minority Oversampling LS-SVM for a Biased Binary Classification

【24h】

Barricaded Boundary Minority Oversampling LS-SVM for a Biased Binary Classification

机译：偏二元分类的有界边界少数采样LS-SVM

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Classifying biased datasets with linearly non-separable features has been a challenge in pattern recognition because traditional classifiers, usually biased and skewed towards the majority class, often produce sub-optimal results. However, if biased or unbalanced data is not processed appropriately, any information extracted from such data risks being compromised. Least Squares Support Vector Machines (LS-SVM) is known for its computational advantage over SVM, however, it suffers from the lack of sparsity of the support vectors: it learns the separating hyper-plane based on the whole dataset and often produces biased hyper-planes with imbalanced datasets. Motivated to contribute a novel approach for the supervised classification of imbalanced datasets, we propose Barricaded Boundary Minority Oversampling (BBMO) that oversamples the minority samples at the boundary in the direction of the closest majority samples to remove LS-SVM's bias due to data imbalance. Two variations of BBMO are studied: BBMOl for the linearly separable case which uses the Lagrange multipliers to extract boundary samples from both classes, and the generalized BBM02 for the nonlinear case which uses the kernel matrix to extract the closest majority samples to each minority sample. In either case, BBMO computes the weighted means as new synthetic minority samples and appends them to the dataset. Experiments on different synthetic and real-world datasets show that BBMO with LS-SVM improved on other methods in the literature and motivates follow on research.

机译：用线性不可分的特征对有偏见的数据集进行分类一直是模式识别中的一个挑战，因为传统的分类器通常会偏向多数类并偏向多数类，通常会产生次优的结果。但是，如果没有正确处理有偏见或不平衡的数据，则从此类数据中提取的任何信息都可能受到损害。最小二乘支持向量机（LS-SVM）以其优于SVM的计算优势而著称，但是它缺乏支持向量的稀疏性：它基于整个数据集学习分离的超平面，并且经常产生偏向超数据集不平衡的飞机。为了为不平衡数据集的监督分类提供一种新颖的方法，我们提出了有条件的边界少数群体过采样（BBMO），该边界偏向少数群体样本沿最接近的多数样本的方向进行过采样，以消除由于数据不平衡而导致的LS-SVM偏差。研究了BBMO的两个变体：用于线性可分离情况的BBMO1，它使用拉格朗日乘数来从两个类别中提取边界样本;对于用于非线性情况的广义BBM02，它使用核矩阵来提取与每个少数样本最接近的多数样本。在这两种情况下，BBMO都会将加权均值计算为新的合成少数样本并将其附加到数据集。在不同的合成和真实数据集上进行的实验表明，带有LS-SVM的BBMO在文献中的其他方法上得到了改进，并激发了后续研究的动机。

著录项

来源
《Discovery science》|2018年|18-32|共15页
会议地点 Limassol(CY)
作者
Hmayag Partamian; Yara Rizk; Mariette Awad;
展开▼
作者单位

Department of Electrical and Computer Engineering, American University of Beirut, Beirut, Lebanon;

Department of Electrical and Computer Engineering, American University of Beirut, Beirut, Lebanon;

Department of Electrical and Computer Engineering, American University of Beirut, Beirut, Lebanon;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Biased datasets; Linearly separable features; Weighted means; Barricaded boundary minority oversampling; Kernel matrix;

机译：偏向数据集；线性可分离特征；加权均值；边界少数族裔过采样；核矩阵;

相似文献

外文文献
中文文献
专利

1. Adaptive swarm cluster-based dynamic multi-objective synthetic minority oversampling technique algorithm for tackling binary imbalanced datasets in biomedical data classification [J] . Jinyan Li, Simon Fong, Yunsick Sung, BioData Mining . 2016,第1期

机译：生物医学数据分类中基于二元不平衡数据集的自适应群聚动态多目标合成少数过采样技术算法
2. Local distribution-based adaptive minority oversampling for imbalanced data classification [J] . Wang Xinyue, Xu Jian, Zeng Tieyong, Neurocomputing . 2021,第Jana21期

机译：基于地方分布的自适应少数群体过采样，用于不平衡数据分类
3. An Improving Majority Weighted Minority Oversampling Technique for Imbalanced Classification Problem [J] . Chao-Ran Wang, Xin-Hui Shao Quality Control, Transactions . 2021,第1期

机译：一种改进多数加权少数少数少数人超法，用于实施分类问题
4. A Novel Adaptive Minority Oversampling Technique for Improved Classification in Data Imbalanced Scenarios [C] . Ayush Tripathi, Rupayan Chakraborty, Sunil Kumar Kopparapu International Conference on Pattern Recognition . 2021

机译：一种新型自适应少数群体过采样技术，用于改进数据分类的简产情景
5. Binary dithered oversampling analog to digital converter. [D] . Helou, Jirar Nicolas. 2011

机译：二进制抖动过采样模数转换器。
6. Adaptive swarm cluster-based dynamic multi-objective synthetic minority oversampling technique algorithm for tackling binary imbalanced datasets in biomedical data classification [O] . Jinyan Li, Simon Fong, Yunsick Sung, 2016

机译：生物医学数据分类中基于二元不平衡数据集的自适应群聚动态多目标综合少数抽样技术算法
7. Adaptive swarm cluster-based dynamic multi-objective synthetic minority oversampling technique algorithm for tackling binary imbalanced datasets in biomedical data classification [O] . 2016

机译：生物医学数据分类中基于二元不平衡数据集的自适应群聚动态多目标综合少数抽样技术算法

Barricaded Boundary Minority Oversampling LS-SVM for a Biased Binary Classification

摘要

著录项

相似文献

相关主题

期刊订阅