首页> 外文会议>Computer Information Systems and Industrial Management >Rough Sets in Imbalanced Data Problem: Improving Re-sampling Process
【24h】

Rough Sets in Imbalanced Data Problem: Improving Re-sampling Process

机译:不平衡数据中的粗糙集问题:改进重采样过程

获取原文
获取原文并翻译 | 示例

摘要

Imbalanced data problem is still one of the most interesting and important research subjects. The latest experiments and detailed analysis revealed that not only the underrepresented classes are the main cause of performance loss in machine learning process, but also the inherent complex characteristics of data. The list of discovered significant difficulty factors consists of the phenomena like class overlapping, decomposition of the minority class, presence of noise and outliers. Although there are numerous solutions proposed, it is still unclear how to deal with all of these issues together and correctly evaluate the class distribution to select a proper treatment (especially considering the real-world applications where levels of uncertainty are eminently high). Since applying rough sets theory to the imbalanced data learning problem could be a promising research direction, the improved re-sampling approach combining selective preprocessing and editing techniques is introduced in this paper. The novel technique allows both qualitative and quantitative data handling.
机译:数据不平衡问题仍然是最有趣和重要的研究课题之一。最新的实验和详细的分析表明,不仅代表不足的类是机器学习过程中性能下降的主要原因,而且还是数据固有的复杂特性。已发现的重大困难因素包括类别重叠,少数类别分解,噪声和异常值等现象。尽管提出了许多解决方案,但仍不清楚如何一起处理所有这些问题并正确评估类分布以选择适当的处理方法(尤其是考虑到不确定性水平非常高的实际应用)。由于将粗糙集理论应用于不平衡数据学习问题可能是一个有前途的研究方向,因此本文介绍了结合选择性预处理和编辑技术的改进的重采样方法。这项新技术可以进行定性和定量数据处理。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号