A noise-detection based AdaBoost algorithm for mislabeled data

Cao J.; Kwong S.; Wang R.

首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >A noise-detection based AdaBoost algorithm for mislabeled data

【24h】

A noise-detection based AdaBoost algorithm for mislabeled data

机译：一种基于噪声检测的AdaBoost算法，用于标签数据错误

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Noise sensitivity is known as a key related issue of AdaBoost algorithm. Previous works exhibit that AdaBoost is prone to be overfitting in dealing with the noisy data sets due to its consistent high weights assignment on hard-to-learn instances (mislabeled instances or outliers). In this paper, a new boosting approach, named noise-detection based AdaBoost (ND-AdaBoost), is exploited to combine classifiers by emphasizing on training misclassified noisy instances and correctly classified non-noisy instances. Specifically, the algorithm is designed by integrating a noise-detection based loss function into AdaBoost to adjust the weight distribution at each iteration. A k-nearest-neighbor (k-NN) and an expectation maximization (EM) based evaluation criteria are both constructed to detect noisy instances. Further, a regeneration condition is presented and analyzed to control the ensemble training error bound of the proposed algorithm which provides theoretical support. Finally, we conduct some experiments on selected binary UCI benchmark data sets and demonstrate that the proposed algorithm is more robust than standard and other types of AdaBoost for noisy data sets.

机译：噪声敏感度是AdaBoost算法的一个关键相关问题。以前的工作表明，由于AdaBoost在难以学习的实例（标记错误的实例或异常值）上始终具有较高的权重分配，因此在处理嘈杂的数据集时倾向于过度拟合。在本文中，一种新的增强方法被称为基于噪声检测的AdaBoost（ND-AdaBoost），它通过强调训练错误分类的有噪声实例和正确分类的无噪声实例来组合分类器。具体而言，通过将基于噪声检测的损失函数集成到AdaBoost中以在每次迭代中调整权重分布来设计算法。构造了一个k近邻（k-NN）和一个基于期望最大化（EM）的评估标准来检测嘈杂的实例。此外，提出并分析了再生条件，以控制所提出算法的整体训练误差范围，为理论提供了支持。最后，我们对选定的UCI二进制基准数据集进行了一些实验，并证明了该算法比标准和其他类型的AdaBoost噪声数据集更健壮。

著录项

来源
《Pattern Recognition: The Journal of the Pattern Recognition Society》 |2012年第12期|共15页
作者
Cao J.; Kwong S.; Wang R.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
AdaBoost; EM; Ensemble learning; k-NN; Pattern recognition;

机译：AdaBoost;EM;集成学习;k-NN;模式识别;

相似文献

外文文献
中文文献
专利

1. A noise-detection based AdaBoost algorithm for mislabeled data [J] . Cao J., Kwong S., Wang R. Pattern Recognition: The Journal of the Pattern Recognition Society . 2012,第12期

机译：一种基于噪声检测的AdaBoost算法，用于标签数据错误
2. Acoustic Seabed Classification Based on Multibeam Echosounder Backscatter Data Using the PSO-BP-AdaBoost Algorithm: A Case Study From Jiaozhou Bay, China [J] . Ji Xue, Yang Bisheng, Tang Qiuhua IEEE Journal of Oceanic Engineering . 2021,第2期

机译：基于Multibeam Echosounder反向散射数据的声学海底分类使用PSO-BP-Adaboost算法：胶州湾，中国的案例研究
3. Temperature Field Online Reconstruction for In-Service Concrete Arch Dam Based on Limited Temperature Observation Data Using AdaBoost-ANN Algorithm [J] . Zhuoyan Chen, Dongjian Zheng, Jiqiong Li, Mathematical Problems in Engineering: Theory, Methods and Applications . 2021,第a期

机译：使用Adaboost-Ann算法基于有限温度观测数据的役混凝土拱坝在线重建
4. Research on imbalanced data : based on SMOTE-AdaBoost algorithm [C] . Mengyu Lv, Yi Ren, Yufen Chen International Conference on Electronic Information Technology and Computer Engineering . 2019

机译：数据不平衡研究：基于SMOTE-AdaBoost算法
5. A near real-time, highly scalable, parallel and distributed adaptive object detection and re-training framework based on the AdaBoost algorithm [D] . Abualkibash, Munther 2015

机译：基于AdaBoost算法的近实时，高度可扩展，并行和分布式的自适应对象检测和再训练框架
6. Adaboost face detector based on Joint Integral Histogram and Genetic Algorithms for feature extraction process [O] . Ameni Yangui Jammoussi, Sameh Fakhfakh Ghribi, Dorra Sellami Masmoudi -1

机译：基于联合积分直方图和遗传算法的Adaboost人脸检测器特征提取过程
7. Combining adaboost with preprocessing algorithms for extracting fuzzy rules from low quality data in possibly imbalanced problems [O] . Palacios Jiménez Ana María, Sánchez Ramos Luciano, Couso Blanco Inés 2012

机译：将adaboost与预处理算法结合使用，以从可能不平衡的问题中从低质量数据中提取模糊规则

A noise-detection based AdaBoost algorithm for mislabeled data

摘要

著录项

相似文献

相关主题

期刊订阅