首页> 外文会议>International Conference on Electrical, Communication and Computer Engineering >Importance of Data Re-Sampling and Dimensionality Reduction in Predicting Students' Success
【24h】

Importance of Data Re-Sampling and Dimensionality Reduction in Predicting Students' Success

机译:预测学生成功的数据重新采样和维度减少的重要性

获取原文

摘要

In this paper, we present the importance of data pre-processing in predicting students' success. We implemented Principal Component Analysis for dimensionality reduction to achieve better model performance. Data re-sampling technique was also utilized to handle the imbalanced class problem that is one of the significant issues in effective classification in Educational Data Mining due to the nature of the data from educational settings. We also performed a comparative analysis on the impacts of Random Under-Sampling (RUS), Random Over-Sampling (ROS), and Synthetic Minority Over-Sampling Technique (SMOTE) to an imbalanced dataset used in this study. SMOTE and PCA techniques application offer better performance compared to RUS and ROS with PCA. Support Vector Machine had the best accuracy value of 0.94 after the application of SMOTE and PCA. The application of PCA on the imbalanced data also positively affected the accuracy of the models used in this study. We used other performance metrics to evaluate our models: Kappa, Area Under Curve, and Precision-Recall curve. Our finding shows that the predictive models can predict student success with the proper application of PCA and data re-samnling techniques.
机译:在本文中,我们展示了数据预处理预测学生成功的重要性。我们实施了主要成分分析,以实现更好的模型性能。数据重新采样技术还用于处理不平衡的类问题,这是教育数据挖掘的有效分类中的重要问题之一,由于来自教育环境的数据的性质。我们还对本研究中使用的不平衡数据集进行了对随机欠抽样(RUS),随机上采样(ROS),随机上采样(ROS)和合成少数群体过采样技术(SMOTE)的影响的比较分析。与PCA的RUS和ROS相比,SMOTE和PCA技术应用程序提供更好的性能。 Smote和PCA在应用后,支持向量机具有0.94的最佳精度值。 PCA在不平衡数据上的应用也积极影响了本研究中使用的模型的准确性。我们使用其他性能指标来评估我们的模型:kappa,曲线下的区域和精密召回曲线。我们的发现表明,预测模型可以预测学生成功,正确应用PCA和数据重新素描技术。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号