首页> 外文会议>IFIP TC8 International Conference on Computer Information Systems and Industrial Management >Rough Sets in Imbalanced Data Problem: Improving Re-sampling Process
【24h】

Rough Sets in Imbalanced Data Problem: Improving Re-sampling Process

机译:不平衡数据问题的粗糙集:改善重新采样过程

获取原文
获取外文期刊封面目录资料

摘要

Imbalanced data problem is still one of the most interesting and important research subjects. The latest experiments and detailed analysis revealed that not only the underrepresented classes are the main cause of performance loss in machine learning process, but also the inherent complex characteristics of data. The list of discovered significant difficulty factors consists of the phenomena like class overlapping, decomposition of the minority class, presence of noise and outliers. Although there are numerous solutions proposed, it is still unclear how to deal with all of these issues together and correctly evaluate the class distribution to select a proper treatment (especially considering the real-world applications where levels of uncertainty are eminently high). Since applying rough sets theory to the imbalanced data learning problem could be a promising research direction, the improved re-sampling approach combining selective preprocessing and editing techniques is introduced in this paper. The novel technique allows both qualitative and quantitative data handling.
机译:不平衡数据问题仍然是最有趣和最重要的研究科目之一。最新的实验和详细分析表明,不仅持代表性的课程不仅是机器学习过程中性能损失的主要原因,而且是数据的固有复杂特性。被发现的显着难度因素的列表包括类别的现象,如类重叠,分解少数阶级,噪音和异常值的存在。虽然有提出了许多解决方案,目前还不清楚如何处理所有这些问题一起,正确评价类别分布来选择适当的治疗(特别是考虑到现实世界的应用,其中的不确定性水平突出地高)。由于将粗糙集理论应用于不平衡的数据学习问题,因此本文介绍了组合选择性预处理和编辑技术的改进的再采样方法。新颖的技术允许定性和定量数据处理。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号