首页> 外文期刊>Complexity >Analysis and Prediction of CET4 Scores Based on Data Mining Algorithm
【24h】

Analysis and Prediction of CET4 Scores Based on Data Mining Algorithm

机译:基于数据挖掘算法的CET4分数分析与预测

获取原文
       

摘要

This paper presents the concept and algorithm of data mining and focuses on the linear regression algorithm. Based on the multiple linear regression algorithm, many factors affecting CET4 are analyzed. Ideas based on data mining, collecting history data and appropriate to transform, using statistical analysis techniques to the many factors influencing the CET-4 test were analyzed, and we have obtained the CET-4 test result and its influencing factors. It was found that the linear regression relationship between the degrees of fit was relatively high. We further improve the algorithm and establish a partition-weighted K -nearest neighbor algorithm. The K-weighted K nearest neighbor algorithm and the partition algorithm are used in the CET-4 test score classification prediction, and the statistical method is used to study the relevant factors that affect the CET-4 test score, and screen classification is performed to predict when the comparison verification will pass. The weight K of the input feature and the adjacent feature are weighted, although the allocation algorithm of the adjacent classification effect has not been significantly improved, but the stability classification is better than K -nearest neighbor algorithm, its classification efficiency is greatly improved, classification time is greatly reduced, and classification efficiency is increased by 119%. In order to detect potential risk graduating students earlier, this paper proposes an appropriate and timely early warning and preschool K-nearest neighbor algorithm classification model. Taking test scores or make-up exams and re-learning as input features, the classification model can effectively predict ordinary students who have not graduated.
机译:本文介绍了数据挖掘的概念和算法,侧重于线性回归算法。基于多元线性回归算法,分析了影响CET4的许多因素。分析了基于数据挖掘,收集历史数据和适当转换的想法,使用统计分析技术对影响CET-4测试的许多因素进行分析,并获得了CET-4测试结果及其影响因素。发现合适程度之间的线性回归关系相对较高。我们进一步提高了算法并建立了分区加权k-nearest邻居算法。在CET-4测试得分分类预测中使用K加权k最近邻算法和分区算法,并且统计方法用于研究影响CET-4测试评分的相关因素,并对屏幕分类进行筛选预测比较验证将通过。输入特征和相邻特征的重量k加权,尽管相邻分类效果的分配算法尚未显着提高,但稳定性分类优于K-Nearest邻居算法,其分类效率大大提高,分类时间大大降低,分类效率提高了119%。为了检测潜在的风险毕业生毕业生,本文提出了适当及及时的预警和学龄前的k最近邻居算法分类模型。考试成绩或化妆考试并重新学习作为输入特征,分类模型可以有效地预测未毕业的普通学生。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号