首页> 外文会议>Iberoamerican Congress on Pattern Recognition >Managing Imbalanced Data Sets in Multi-label Problems: A Case Study with the SMOTE Algorithm

【24h】

Managing Imbalanced Data Sets in Multi-label Problems: A Case Study with the SMOTE Algorithm

机译：管理多标签问题中的不平衡数据集：用粉碎算法进行案例研究

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Multi-label learning has been becoming an increasingly active area into the machine learning community since a wide variety of real world problems are naturally multi-labeled. However, it is not uncommon to find disparities among the number of samples of each class, which constitutes an additional challenge for the learning algorithm. SMOTE is an oversampling technique that has been successfully applied for balancing single-labeled data sets, but has not been used in multi-label frameworks so far. In this work, several strategies are proposed and compared in order to generate synthetic samples for balancing data sets in the training of multi-label algorithms. Results show that a correct selection of seed samples for oversampling improves the classification performance of multi-label algorithms. The uniform generation oversampling, provides an efficient methodology for a wide scope of real world problems.

机译：自从各种各样的现实世界问题自然是多标记的，多标签学习已成为机器学习界的越来越活跃的地区。然而，在每个类的样本数量之间发现差异并不罕见，这构成了学习算法的额外挑战。 Smote是一种过采样技术，已成功应用于平衡单标签数据集，但到目前为止还没有用于多标签框架。在这项工作中，提出了几种策略，并比较了为在多标签算法训练中产生用于平衡数据集的合成样本。结果表明，用于过采样的正确选择采样改善了多标签算法的分类性能。统一一代过采样，为广泛的现实世界问题提供了有效的方法。

著录项

来源
《Iberoamerican Congress on Pattern Recognition 》|2013年||共9页
会议地点
作者
Andres Felipe Giraldo-Forero; Jorge Alberto Jaramillo-Garzon; Jose Francisco Ruiz-Munoz; Cesar German Castellanos-Dominguez;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP391.4-53;
关键词

相似文献

外文文献
中文文献
专利

1. SMOTE-RSB_*: a hybrid preprocessing approach based on oversampling and undersampling for high imbalanced data-sets using SMOTE and rough sets theory [J] . Enislay Ramentol, Yaile Caballero, Rafael Bello, Knowledge and information systems . 2012 ,第2期

机译：SMOTE-RSB_ *：使用SMOTE和粗糙集理论的基于过采样和欠采样的混合预处理方法，用于高不平衡数据集
2. SMOTE-RSB *: a hybrid preprocessing approach based on oversampling and undersampling for high imbalanced data-sets using SMOTE and rough sets theory [J] . Enislay Ramentol, Yailé Caballero, Rafael Bello, Knowledge and Information Systems . 2012 ,第2期

机译：SMOTE-RSB * ：一种基于过采样和欠采样的混合预处理方法，使用SMOTE和粗糙集理论处理高不平衡数据集
3. ENHANCING THE PERFORMANCE OF SMOTE ALGORITHM BY USING ATTRIBUTE WEIGHTING SCHEME AND NEW SELECTIVE SAMPLING METHOD FOR IMBALANCED DATA SET [J] . TORA FAHRUDIN, JOKO LIANTO BULIALI, CHASTINE FATICHAH International Journal of Innovative Computing Information and Control . 2019 ,第2期

机译：使用属性加权方案和新的选择采样方法来增强Smote算法的性能
4. Managing Imbalanced Data Sets in Multi-label Problems: A Case Study with the SMOTE Algorithm [C] . Andres Felipe Giraldo-Forero, Jorge Alberto Jaramillo-Garzon, Jose Francisco Ruiz-Munoz, Iberoamerican congress on pattern recognition . 2013

机译：在多标签问题中管理不平衡数据集：以SMOTE算法为例
5. Online Classification Methods for Imbalance and Multi-Label Data [D] . Du, Jie. 2019

机译：不平衡和多标签数据的在线分类方法
6. Thick Slices from Tomosynthesis Data Sets: Phantom Study for the Evaluation of Different Algorithms [O] . Felix Diekmann, Henning Meyer, Susanne Diekmann, 2009

机译：层析合成数据集中的厚切片：用于评估不同算法的幻像研究
7. Managing Imbalanced Data Sets in Multi-label Problems: A Case Study with the SMOTE Algorithm [O] . Andrés Felipe Giraldo-Forero, Jorge Alberto Jaramillo-Garzón, José Francisco Ruiz-Muñoz, 2013

机译：管理多标签问题中的不平衡数据集：用粉碎算法进行案例研究

Managing Imbalanced Data Sets in Multi-label Problems: A Case Study with the SMOTE Algorithm

摘要

著录项

相似文献

相关主题

期刊订阅