Managing Imbalanced Data Sets in Multi-label Problems: A Case Study with the SMOTE Algorithm

机译：在多标签问题中管理不平衡数据集：以SMOTE算法为例

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Multi-label learning has been becoming an increasingly active area into the machine learning community since a wide variety of real world problems are naturally multi-labeled. However, it is not uncommon to find disparities among the number of samples of each class, which constitutes an additional challenge for the learning algorithm. Smote is an oversampling technique that has been successfully applied for balancing single-labeled data sets, but has not been used in multi-label frameworks so far. In this work, several strategies are proposed and compared in order to generate synthetic samples for balancing data sets in the training of multi-label algorithms. Results show that a correct selection of seed samples for oversampling improves the classification performance of multi-label algorithms. The uniform generation oversampling, provides an efficient methodology for a wide scope of real world problems.

机译：多标签学习已成为机器学习社区中一个越来越活跃的领域，因为各种各样的现实世界问题自然都是多标签的。然而，发现每个类别的样本数量之间的差异并不少见，这对学习算法构成了额外的挑战。 Smote是一种过采样技术，已成功应用于平衡单标签数据集，但到目前为止尚未在多标签框架中使用。在这项工作中，提出并比较了几种策略，以生成用于平衡多标签算法训练中的数据集的合成样本。结果表明，正确选择种子样本进行过采样可以提高多标签算法的分类性能。统一的代过采样为广泛的现实问题提供了一种有效的方法。

著录项

来源
《Iberoamerican congress on pattern recognition》|2013年|334-342|共9页
会议地点
作者
Andres Felipe Giraldo-Forero; Jorge Alberto Jaramillo-Garzon; Jose Francisco Ruiz-Munoz; Cesar German Castellanos-Dominguez;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. SMOTE-RSB_*: a hybrid preprocessing approach based on oversampling and undersampling for high imbalanced data-sets using SMOTE and rough sets theory [J] . Enislay Ramentol, Yaile Caballero, Rafael Bello, Knowledge and information systems . 2012,第2期

机译：SMOTE-RSB_ *：使用SMOTE和粗糙集理论的基于过采样和欠采样的混合预处理方法，用于高不平衡数据集
2. SMOTE-RSB *: a hybrid preprocessing approach based on oversampling and undersampling for high imbalanced data-sets using SMOTE and rough sets theory [J] . Enislay Ramentol, Yailé Caballero, Rafael Bello, Knowledge and Information Systems . 2012,第2期

机译：SMOTE-RSB * ：一种基于过采样和欠采样的混合预处理方法，使用SMOTE和粗糙集理论处理高不平衡数据集
3. ENHANCING THE PERFORMANCE OF SMOTE ALGORITHM BY USING ATTRIBUTE WEIGHTING SCHEME AND NEW SELECTIVE SAMPLING METHOD FOR IMBALANCED DATA SET [J] . TORA FAHRUDIN, JOKO LIANTO BULIALI, CHASTINE FATICHAH International Journal of Innovative Computing Information and Control . 2019,第2期

机译：使用属性加权方案和新的选择采样方法来增强Smote算法的性能
4. Managing Imbalanced Data Sets in Multi-label Problems: A Case Study with the SMOTE Algorithm [C] . Andres Felipe Giraldo-Forero, Jorge Alberto Jaramillo-Garzon, Jose Francisco Ruiz-Munoz, Iberoamerican Congress on Pattern Recognition . 2013

机译：管理多标签问题中的不平衡数据集：用粉碎算法进行案例研究
5. Online Classification Methods for Imbalance and Multi-Label Data [D] . Du, Jie. 2019

机译：不平衡和多标签数据的在线分类方法
6. Thick Slices from Tomosynthesis Data Sets: Phantom Study for the Evaluation of Different Algorithms [O] . Felix Diekmann, Henning Meyer, Susanne Diekmann, 2009

机译：层析合成数据集中的厚切片：用于评估不同算法的幻像研究
7. Managing Imbalanced Data Sets in Multi-label Problems: A Case Study with the SMOTE Algorithm [O] . Andrés Felipe Giraldo-Forero, Jorge Alberto Jaramillo-Garzón, José Francisco Ruiz-Muñoz, 2013

机译：管理多标签问题中的不平衡数据集：用粉碎算法进行案例研究

Managing Imbalanced Data Sets in Multi-label Problems: A Case Study with the SMOTE Algorithm

摘要

著录项

相似文献

相关主题

期刊订阅