首页> 外文会议>International Conference on Information and Communications Technology >Data level approach for imbalanced class handling on educational data mining multiclass classification
【24h】

Data level approach for imbalanced class handling on educational data mining multiclass classification

机译:教育数据挖掘多类分类中不平衡类处理的数据级方法

获取原文

摘要

In Educational Data Mining (EDM), researchers usually overlook the balance of the distribution on a dataset. It can seriously affect the result of the classification process. Theoretically, the majority of classifier assumed that the distribution of the data is relatively balanced. Hence, the performance of the classification algorithm just become less effective and need to be handled so the problem can be solved. This study will explain about imbalanced class on multiclass EDM dataset handling mechanism using the combination of SMOTE and OSS. SMOTE and OSS method provides balancing mechanism for the dataset's distribution, so that the classification results will be enhanced in terms of classification performance. The result shows that the combination of SMOTE and OSS can enhance the performance of SVM as the classification method that used in this study. Those combination of methods produce the accuracy, sensitivity, specificity, and g-mean score as high as 88.637%, 92.292%, 95.554%, 93.796% respectively. Hence, the SMOTE and OSS combination can be a viable solution for imbalanced class on EDM's multiclass dataset.
机译:在教育数据挖掘(EDM)中,研究人员通常会忽略数据集上分布的平衡。它会严重影响分类过程的结果。从理论上讲,大多数分类器都假设数据的分布是相对平衡的。因此,分类算法的性能只是变得不太有效,需要对其进行处理,从而可以解决该问题。这项研究将解释结合使用SMOTE和OSS的多类EDM数据集处理机制上的不平衡类。 SMOTE和OSS方法为数据集的分布提供了一种平衡机制,从而可以提高分类结果的分类性能。结果表明,SMOTE和OSS的组合可以增强SVM的性能,这是本研究中使用的分类方法。这些方法的组合产生的准确性,敏感性,特异性和g均值分别高达88.637%,92.292%,95.554%和93.796%。因此,对于EDM的多类数据集上的不平衡类,SMOTE和OSS组合可能是可行的解决方案。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号