...
首页> 外文期刊>International Journal of Advances in Soft Computing and Its Applications >Dimensionality Reduction for Predicting Student Performance in Unbalanced Data Sets
【24h】

Dimensionality Reduction for Predicting Student Performance in Unbalanced Data Sets

机译:降维可预测不平衡数据集中的学生表现

获取原文

摘要

In this study, we evaluated two data sets from two Portuguese schools for predicting student performance. These data sets contain not only the previous grades of the students, but also the demographic, social and school related features. Both data sets are unbalanced in class distribution and contained some irrelevant features. Such characteristics may cause unsatisfactory True Positive (TP) rates for the Fail grade. This grade is important in prediction but it has a low representation as compared with the Pass grade. To improve prediction, dimensionality reduction was performed on both data sets to generate subsets that contained: (i) features selected by a wrapper approach, and (ii) only previous grade(s). The results showed that dimensionality reduction helped to improve the TP rates for the Fail grade. In addition, good classification accuracies were attained. We also noticed that even though the subsets contain only one previous grade, comparable accuracies can also be achieved.
机译:在这项研究中,我们评估了来自两所葡萄牙学校的两个数据集,以预测学生的表现。这些数据集不仅包含学生以前的成绩,还包含人口统计学,社会和学校相关的特征。这两个数据集的类分布不平衡,并且包含一些不相关的功能。这样的特性可能会导致不合格等级的真实正(TP)率不令人满意。该成绩对预测很重要,但与及格成绩相比,它的代表性低。为了改善预测,对两个数据集都进行了降维,以生成包含以下内容的子集:(i)通过包装方法选择的特征,以及(ii)仅以前的等级。结果表明降维有助于提高不合格品级的TP率。另外,获得了良好的分类精度。我们还注意到,即使子集仅包含一个以前的等级,也可以实现可比较的精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号