首页> 外文会议>International Conference on Web Information Systems Engineering >A Dynamic Decision-Making Method Based on Ensemble Methods for Complex Unbalanced Data
【24h】

A Dynamic Decision-Making Method Based on Ensemble Methods for Complex Unbalanced Data

机译:一种基于集合方法的组合方法的动态决策方法

获取原文

摘要

Class imbalance has been proven to seriously hinder the precision of many standard learning algorithms. To solve this problem, a number of methods have been proposed, for example, the distance-based balancing ensemble method that learns the unbalanced dataset by converting it into multiple balanced subsets on which sub-classifiers are built. However, the class-imbalance problem is usually accompanied by other data-complexity problems such as class overlap, small disjuncts, and noise instance. Current algorithms developed for primary unbalanced-data problems cannot address the complex-data problems at the same time. Some of these algorithms even exacerbate the class-overlap and small-disjuncts problems after trying to address the complex-data problem. On this account, this study proposes a dynamic ensemble selection decision-making (DESD) method. The DESD first repeats the random-splitting technique to divide the dataset into multiple balanced subsets that contain no or few class-overlap and small-disjunct problems. Then, the classifiers are built on these subsets to compose the candidate classifier pool. To select the most appropriate classifiers from the candidate classifier pool for the classification of each query instance, we use a weighting mechanism to highlight the competence of classifiers that are more powerful in classifying minority instances belonging to the local region in which the query instance is located. Tests with 15 standard datasets from public repositories are performed to demonstrate the effectiveness of the DESD method. The results show that the precision of the DESD method outperforms other ensemble methods.
机译:阶级不平衡已被证明严重阻碍了许多标准学习算法的精度。为了解决这个问题,已经提出了许多方法,例如,通过将其转换为构建子分类器的多个平衡子集来学习不平衡数据集的基于距离的平衡集合方法。但是,类不平衡问题通常伴随着其他数据复杂性问题,例如类重叠,小分数和噪声实例。为主要不平衡数据问题开发的当前算法不能同时解决复杂数据问题。在尝试解决复杂数据问题之后,其中一些算法甚至加剧了类重叠和小分数问题。在此帐户中,本研究提出了一种动态集合选择决策(DESD)方法。 DESD首先重复随机分离技术将数据集分成多个平衡子集,该子集包含不或几类重叠和小分离问题。然后,基于这些子集内置的分类器以撰写候选分类器池。要选择来自候选分类器池中的最合适的分类器,以获取每个查询实例的分类,我们使用加权机制来突出显示在分类属于Query实例所在的本地区域的少数群体实例中更强大的分类器的能力。执行具有公共存储库的15个标准数据集的测试以证明DESD方法的有效性。结果表明,DESD方法的精度优于其他集合方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号