...
首页> 外文期刊>Expert Systems >Resampling with neighbourhood bias on imbalanced domains
【24h】

Resampling with neighbourhood bias on imbalanced domains

机译:在不平衡域上使用邻域偏差重采样

获取原文
获取原文并翻译 | 示例
           

摘要

Imbalanced domains are an important problem that arises in predictive tasks causing a loss in the performance on the most relevant cases for the user. This problem has been extensively studied for classification problems, where the target variable is nominal. Recently, it was recognized that imbalanced domains occur in several other contexts and for multiple tasks, such as regression tasks, where the target variable is continuous. This paper focuses on imbalanced domains in both classification and regression tasks. Resampling strategies are among the most successful approaches to address imbalanced domains. In this work, we propose variants of existing resampling strategies that are able to take into account the information regarding the neighbourhood of the examples. Instead of performing sampling uniformly, our proposals bias the strategies to reinforce some regions of the data sets. With an extensive set of experiments, we provide evidence of the advantage of introducing a neighbourhood bias in the resampling strategies for both classification and regression tasks with imbalanced data sets.
机译:域的不平衡是在预测任务中出现的重要问题,导致在与用户最相关的情况下性能下降。对于目标变量为名义变量的分类问题,已经对该问题进行了广泛研究。最近,人们认识到不平衡域出现在其他几种情况下,并且对于多个任务(例如回归任务),其中目标变量是连续的。本文着重于分类和回归任务中的不平衡域。重采样策略是解决不平衡域的最成功方法之一。在这项工作中,我们提出了现有重采样策略的变体,这些变体能够考虑有关示例邻域的信息。我们的建议并非统一执行抽样,而是偏重于增强数据集某些区域的策略。通过大量的实验,我们提供了在不平衡数据集的分类和回归任务的重采样策略中引入邻域偏差的优势的证据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号