RSKC: An R Package for a Robust and Sparse K-Means Clustering Algorithm

Yumi Kondo; Matias Salibian-Barrera; Ruben Zamar

首页> 外文期刊>Journal of Statistical Software >RSKC: An R Package for a Robust and Sparse K-Means Clustering Algorithm

【24h】

RSKC: An R Package for a Robust and Sparse K-Means Clustering Algorithm

机译：RSKC：鲁棒且稀疏的K均值聚类算法的R包

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Witten and Tibshirani (2010) proposed an algorithim to simultaneously find clusters and select clustering variables, called sparse K-means (SK-means). SK-means is particularly useful when the dataset has a large fraction of noise variables (that is, variables without useful information to separate the clusters). SK-means works very well on clean and complete data but cannot handle outliers nor missing data. To remedy these problems we introduce a new robust and sparse K-means clustering algorithm implemented in the R package RSKC. We demonstrate the use of our package on four datasets. We also conduct a Monte Carlo study to compare the performances of RSK-means and SK-means regarding the selection of important variables and identification of clusters. Our simulation study shows that RSK-means performs well on clean data and better than SK-means and other competitors on outlier-contaminated data.

机译：Witten和Tibshirani（2010）提出了一种算法，用于同时查找聚类并选择聚类变量，称为稀疏K均值（SK-means）。当数据集具有很大一部分噪声变量（即，没有有用信息来分离聚类的变量）时，SK-means尤其有用。 SK-means在干净和完整的数据上效果很好，但不能处理异常值或丢失数据。为了解决这些问题，我们引入了在R包RSKC中实现的新的健壮且稀疏的K均值聚类算法。我们演示了如何在四个数据集上使用我们的包。我们还进行了蒙特卡洛研究，比较RSK-means和SK-means在选择重要变量和识别聚类方面的表现。我们的模拟研究表明，RSK-means在纯净数据上表现良好，并且在异常值受污染的数据上优于SK-means和其他竞争对手。

著录项

来源
《Journal of Statistical Software》 |2016年第1期|共26页
作者
Yumi Kondo; Matias Salibian-Barrera; Ruben Zamar;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. CPI-model-based analysis of sparse k-means clustering algorithms [J] . Kazuo Aoyama, Kazumi Saito, Tetsuo Ikeda International Journal of Data Science and Analytics . 2021,第3期

机译：基于CPI模型的稀疏k均值分析算法分析
2. Robust and sparse k-means clustering for high-dimensional data [J] . Brodinova Sarka, Filzmoser Peter, Ortner Thomas, Advances in data analysis and classification . 2019,第4期

机译：用于高维数据的鲁棒和稀疏k均值聚类
3. Robust Discriminative multi-view K-means clustering with feature selection and group sparsity learning [J] . Zhiqiang Zeng, Xiaodong Wang, Fei Yan, Multimedia Tools and Applications . 2018,第17期

机译：具有特征选择和组稀疏性学习的强大的区分性多视图K均值聚类
4. Robust and Sparse Fuzzy K-Means Clustering [C] . Jinglin Xu, Junwei Han, Kai Xiong, International Joint Conference on Artificial Intelligence . 2016

机译：鲁棒和稀疏模糊k均值聚类
5. Hardware Implementation and Performance Evaluation of K-Means and K-Means++ Clustering Algorithms [D] . Singh, Manisha . 2019

机译：K-Means和K-Means ++聚类算法的硬件实现和性能评估
6. Does Determination of Initial Cluster Centroids Improve the Performance of K-Means Clustering Algorithm? Comparison of Three Hybrid Methods by Genetic Algorithm Minimum Spanning Tree and Hierarchical Clustering in an Applied Study [O] . Saeedeh Pourahmad, Atefeh Basirat, Amir Rahimi, 2020

机译：初始簇质心的确定是否提高了K-Means聚类算法的性能？应用研究中遗传算法最小生成树和分层聚类的三种混合方法的比较
7. RSKC: AnRPackage for a Robust and Sparse K-Means Clustering Algorithm [O] . Yumi Kondo, Matias Salibian-Barrera, Ruben Zamar 2016

机译：RSKC：用于稳健和稀疏k均值聚类算法的ANRPACKAGE

RSKC: An R Package for a Robust and Sparse K-Means Clustering Algorithm

摘要

著录项

相似文献

相关主题

期刊订阅