首页> 外文会议>Proceedings of the 2010 10th International Conference on Intelligent Systems Design and Applications >A preliminary study on overlapping and data fracture in imbalanced domains by means of Genetic Programming-based feature extraction
【24h】

A preliminary study on overlapping and data fracture in imbalanced domains by means of Genetic Programming-based feature extraction

机译:基于遗传编程的特征提取方法在不平衡域重叠和数据断裂的初步研究

获取原文

摘要

The classification of imbalanced data is a well-studied topic in data mining. However, there is still a lack of understanding of the factors that make the problem difficult. In this work, we study the two main reasons that make the classification of imbalanced datasets complex: overlapping and data fracture. We present a Genetic Programming-based feature extraction method driven by Rough Set Theory to help visualize the data in a bidimensional graph, to better understand how the presence of overlapping and data fractures affect classification performance.
机译:不平衡数据的分类是数据挖掘中经过充分研究的主题。但是,仍然缺乏使问题变得困难的因素的理解。在这项工作中,我们研究了使不平衡数据集的分类变得复杂的两个主要原因:重叠和数据断裂。我们提出一种由粗糙集理论驱动的基于遗传编程的特征提取方法,以帮助可视化二维图中的数据,以更好地了解重叠和数据断裂的存在如何影响分类性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号