首页> 外文会议>International Conference on Vocational Education and Electrical Engineering >Combining the Unsupervised Discretization Method and the Statistical Machine Learning on the Students’ Performance
【24h】

Combining the Unsupervised Discretization Method and the Statistical Machine Learning on the Students’ Performance

机译:无监督离散化方法与统计机器学习相结合的学生表现

获取原文

摘要

the suitability of the data with the method in the process of data mining is very important to increase the process performance. However, In Educational Data Mining (EDM), not much research has focused on this field. Therefore, this study proposes to combine an unsupervised discretization method called "equal width interval" and logistic regression as statistical machine learning to improve the performance of the model relating to students' performance. The discretization method is performed on student data with several intervals, namely: 3-interval, 4-interval, and 5-interval. Then, these intervals are combined with logistic regression in two regularizations, namely: lasso and ridge. Evaluation is carried out on all combinations. The experimental results indicate that the highest performance, in terms of the accuracy level, is achieved by the model combining a 3-interval and logistic regression on all regularization. This combination can increase the model performance based on the average accuracy level of about 4.08-8.49 on the ridge regularization and about 4.28-8.6 on the lasso regularization.
机译:在数据挖掘过程中,该方法对数据的适用性对于提高过程性能非常重要。但是,在教育数据挖掘(EDM)中,对此领域的研究很少。因此,本研究建议将一种称为“等宽度区间”的无监督离散化方法与逻辑回归作为统计机器学习相结合,以改善与学生表现有关的模型的表现。离散化方法以几个间隔(即3个间隔,4个间隔和5个间隔)对学生数据执行。然后,将这些区间与逻辑回归结合成两个正则化,即套索和岭。对所有组合进行评估。实验结果表明,通过在所有正则化上结合了3间隔和logistic回归的模型,可以在准确性水平上实现最高的性能。这种组合可以基于岭正则化上的约4.08-8.49和套索正则化上的约4.28-8.6的平均准确度水平来提高模型性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号