Making kernel-based vector quantization robust and effective for incomplete educational data clustering

Thi Ngoc Chau Vo; Hua Phung Nguyen; Thi Ngoc Tran Vo

首页> 外文期刊>Vietnam Journal of Computer Science >Making kernel-based vector quantization robust and effective for incomplete educational data clustering

【24h】

Making kernel-based vector quantization robust and effective for incomplete educational data clustering

机译：使基于核的矢量量化稳健而有效地用于不完整的教育数据聚类

获取原文

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Abstract Nowadays, knowledge discovered from educational data sets plays an important role in educational decision making support. One kind of such knowledge that enables us to get insights into our students’ characteristics is cluster models generated by a clustering task. Each cluster model presents the groups of similar students by several aspects such as study performance, behavior, skill, etc. Many recent educational data clustering works used the existing algorithms like k -means, expectation–maximization, spectral clustering, etc. Nevertheless, none of them considered the incompleteness of the educational data gathered in an academic credit system although incomplete data handling was figured out well with several different general-purpose solutions. Unfortunately, early in-trouble student detection normally faces data incompleteness as we have collected and processed the study results of the second-, third-, and fourth-year students who have not yet accomplished the program as of that moment. In this situation, the clustering task becomes an inevitable incomplete educational data clustering task. Hence, our work focuses on an incomplete educational data clustering approach to the aforementioned task. Following kernel-based vector quantization, we define a robust effective simple solution, named VQ_fk_nps, which is able to not only handle ubiquitous data incompleteness in an iterative manner using the nearest prototype strategy but also optimize the clusters in the feature space to reach the resulting clusters with arbitrary shapes in the data space. As shown through the experimental results on real educational data sets, the clusters from our solution have better cluster quality as compared to some existing approaches.

机译：摘要如今，从教育数据集中发现的知识在教育决策支持中起着重要作用。一种使我们能够洞悉学生特征的知识是由聚类任务生成的聚类模型。每个聚类模型通过几个方面（如学习成绩，行为，技能等）展示相似的学生群体。许多近期的教育数据聚类工作都使用了现有的算法，例如k均值，期望最大化，光谱聚类等。他们中的一些人认为在学分制中收集的教育数据是不完整的，尽管可以通过几种不同的通用解决方案很好地解决了不完整的数据处理问题。不幸的是，由于我们已经收集并处理了当时尚未完成该计划的第二，第三和第四年级学生的学习结果，因此，早期出现问题的学生检测通常会遇到数据不完整的情况。在这种情况下，聚类任务成为不可避免的不完整的教育数据聚类任务。因此，我们的工作重点是针对上述任务的不完整的教育数据聚类方法。在基于内核的矢量量化之后，我们定义了一个强大的有效简单解决方案，名为VQ_fk_nps，该解决方案不仅可以使用最接近的原型策略以迭代方式处理普遍存在的数据不完整性，还可以优化特征空间中的聚类以达到结果在数据空间中具有任意形状的聚类。如在真实教育数据集上的实验结果所示，与某些现有方法相比，我们解决方案中的群集具有更好的群集质量。

著录项

来源
《Vietnam Journal of Computer Science》 |2016年第2期|共10页
作者
Thi Ngoc Chau Vo; Hua Phung Nguyen; Thi Ngoc Tran Vo;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Clustering Incomplete Data Using Kernel-Based Fuzzy C-means Algorithm [J] . DAO-QIANG ZHANG, SONG-CAN CHEN Neural processing letters . 2003,第3期

机译：基于核的模糊C均值算法对不完整数据进行聚类
2. Efficient approximations of robust soft learning vector quantization for non-vectorial data [J] . Daniela Hofmann, Andrej Gisbrecht, Barbara Hammer Neurocomputing . 2015,第jana5期

机译：非矢量数据的鲁棒软学习矢量量化的有效近似
3. The Fuzzy Learning Vector Quantization with Allied Fuzzy C-means Clustering for Clustering Noisy Data [J] . Xiaohong Wu, BinWu, Yong Deng, Journal of information and computational science . 2011,第9期

机译：关联模糊C均值聚类的模糊学习向量量化
4. A robust and effective algorithmic framework for incomplete educational data clustering [C] . Vo Thi Ngoc Chau, Nguyen Hua Phung, Vo Thi Ngoc Tran National Foundation for Science and Technology Development Conference on Information and Computer Science . 2015

机译：不完整教育数据聚类的强大有效的算法框架
5. Kernel-based clustering of big data. [D] . Chitta, Radha. 2015

机译：基于内核的大数据集群。
6. Vector Quantized Spectral Clustering Applied to Whole Genome Sequences of Plants [O] . Aditya A Shastri, Kapil Ahuja, Milind B Ratnaparkhe, 2019

机译：矢量量化谱聚类应用于植物全基因组序列
7. Making kernel-based vector quantization robust and effective for incomplete educational data clustering [O] . 2016

机译：使基于核的矢量量化稳健而有效地用于不完整的教育数据聚类

Making kernel-based vector quantization robust and effective for incomplete educational data clustering

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅