首页> 外文期刊>Vietnam Journal of Computer Science >Making kernel-based vector quantization robust and effective for incomplete educational data clustering
【24h】

Making kernel-based vector quantization robust and effective for incomplete educational data clustering

机译:使基于核的矢量量化稳健而有效地用于不完整的教育数据聚类

获取原文
获取外文期刊封面目录资料

摘要

Abstract Nowadays, knowledge discovered from educational data sets plays an important role in educational decision making support. One kind of such knowledge that enables us to get insights into our students’ characteristics is cluster models generated by a clustering task. Each cluster model presents the groups of similar students by several aspects such as study performance, behavior, skill, etc. Many recent educational data clustering works used the existing algorithms like k -means, expectation–maximization, spectral clustering, etc. Nevertheless, none of them considered the incompleteness of the educational data gathered in an academic credit system although incomplete data handling was figured out well with several different general-purpose solutions. Unfortunately, early in-trouble student detection normally faces data incompleteness as we have collected and processed the study results of the second-, third-, and fourth-year students who have not yet accomplished the program as of that moment. In this situation, the clustering task becomes an inevitable incomplete educational data clustering task. Hence, our work focuses on an incomplete educational data clustering approach to the aforementioned task. Following kernel-based vector quantization, we define a robust effective simple solution, named VQ_fk_nps, which is able to not only handle ubiquitous data incompleteness in an iterative manner using the nearest prototype strategy but also optimize the clusters in the feature space to reach the resulting clusters with arbitrary shapes in the data space. As shown through the experimental results on real educational data sets, the clusters from our solution have better cluster quality as compared to some existing approaches.
机译:摘要如今,从教育数据集中发现的知识在教育决策支持中起着重要作用。一种使我们能够洞悉学生特征的知识是由聚类任务生成的聚类模型。每个聚类模型通过几个方面(如学习成绩,行为,技能等)展示相似的学生群体。许多近期的教育数据聚类工作都使用了现有的算法,例如k均值,期望最大化,光谱聚类等。他们中的一些人认为在学分制中收集的教育数据是不完整的,尽管可以通过几种不同的通用解决方案很好地解决了不完整的数据处理问题。不幸的是,由于我们已经收集并处理了当时尚未完成该计划的第二,第三和第四年级学生的学习结果,因此,早期出现问题的学生检测通常会遇到数据不完整的情况。在这种情况下,聚类任务成为不可避免的不完整的教育数据聚类任务。因此,我们的工作重点是针对上述任务的不完整的教育数据聚类方法。在基于内核的矢量量化之后,我们定义了一个强大的有效简单解决方案,名为VQ_fk_nps,该解决方案不仅可以使用最接近的原型策略以迭代方式处理普遍存在的数据不完整性,还可以优化特征空间中的聚类以达到结果在数据空间中具有任意形状的聚类。如在真实教育数据集上的实验结果所示,与某些现有方法相比,我们解决方案中的群集具有更好的群集质量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号