首页> 外国专利> Method of under-sampling based ensemble for data imbalance problem

Method of under-sampling based ensemble for data imbalance problem

机译：解决数据不平衡问题的基于欠采样的集成方法

页面导航

摘要
著录项
相似文献

摘要

The present invention relates to an undersampling-based ensemble method for resolving data imbalance. In the present invention, the steps of dividing into multiple categories (normal companies) and minority categories (basic companies) based on large number of corporate insolvency data, forming a set of sub-instances by undersampling, and reducing information loss of a sub-group for the population To measure, the steps of measuring the similarity between the data of the population and the data of the subgroup, the step of learning each subgroup using a basic learner and constructing an ensemble, and the performance of each classifier using a test set for verification. It is characterized by including the step of evaluating and measuring the statistical significance of their performance differences.

机译：本发明涉及一种用于解决数据不平衡的基于欠采样的集成方法。在本发明中，基于大量公司破产数据将其分为多个类别（普通公司）和少数类别（基本公司）的步骤，通过欠采样形成一组子实例，并减少子实体的信息损失人口组要进行测量的步骤包括：测量人口数据与子组数据之间的相似性的步骤，使用基本学习器学习每个子组并构建集合的步骤以及使用测试的每个分类器的性能进行验证。它的特征在于包括评估和测量其性能差异的统计显着性的步骤。

著录项

公开/公告号KR20200113397A

专利类型
公开/公告日2020-10-07

原文格式PDF
申请/专利权人 동서대학교 산학협력단;
展开▼

申请/专利号KR20190033526
发明设计人 강대기;
展开▼

申请日2019-03-25
分类号G06N20/20;G06F16/906;
国家 KR
入库时间 2022-08-21 11:05:52

相似文献

专利
外文文献
中文文献