A Novel on Altered K-Means Algorithm for Clustering Cost Decrease of Non-labeling Big-Data

机译：一种新的非标签大数据聚类成本降低的K均值算法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Machine learning in Big Data is getting the spotlight to retrieve useful knowledge inherent in multi-dimensional information and discover new inherent knowledge in the fields related to the storage and retrieval of massive multi-dimensional information that is newly produced. The machine learning technique can be divided into supervised and unsupervised learning according to whether there is data labeling or not. Unsupervised learning, which is a technique to classify and analyze data with no labeling, is utilized in various ways in the analysis of multi-dimensional Big Data. The present study thus proposed an altered K-means algorithm to analyze the problems with the old one and determine the number of clusters automatically. The study also proposed an approach of optimizing the number of clusters through principal component analysis, a pre-processing process, with the input data for clustering. The performance evaluation results confirm that the CVI of the proposed algorithm was superior to that of the old K-means algorithm in accuracy.

机译：大数据中的机器学习正在吸引人们关注，以检索多维信息中固有的有用知识，并在与新生成的大量多维信息的存储和检索有关的领域中发现新的固有知识。根据是否有数据标记，机器学习技术可以分为有监督学习和无监督学习。无监督学习是一种无标签分类和分析数据的技术，它在多维大数据分析中以各种方式被利用。因此，本研究提出了一种改进的K均值算法，以分析旧算法的问题并自动确定聚类数。该研究还提出了一种通过主成分分析，预处理过程以及输入数据进行聚类来优化聚类数量的方法。性能评估结果表明，该算法的CVI精度优于旧的K-means算法。

著录项

来源
《International conference on future information technology;International conference on multimedia and ubiquitous engineering》|2018年|375-381|共7页
会议地点 Salerno(IT)
作者
Se-Hoon Jung; Won-Ho So; Kang-Soo You; Chun-Bo Sim;
展开▼
作者单位

Department of Multimedia Engineering Sunchon National University Suncheon Republic of Korea;

Department of Computer Education Sunchon National University Suncheon Republic of Korea;

School of Liberal Arts Jeonju University Jeonju Republic of Korea;

School of Information Communication and Multimedia Engineering Sunchon National University Suncheon Republic of Korea;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Altered K-means; PCA; Big-Data; Multidimensional;

机译：改变的K均值; PCA；大数据;多维的;

相似文献

外文文献
中文文献
专利

1. Evaluation Of Fuzzy K-Means And K-Means Clustering Algorithms In Intrusion Detection Systems [J] . Farhad Soleimanian Gharehchopogh, Neda Jabbari, Zeinab Ghaffari Azar International Journal of Scientific & Technology Research . 2012,第11期

机译：入侵检测系统中模糊K-均值和K-均值聚类算法的评估
2. Does Determination of Initial Cluster Centroids Improve the Performance of K-Means Clustering Algorithm? Comparison of Three Hybrid Methods by Genetic Algorithm, Minimum Spanning Tree, and Hierarchical Clustering in an Applied Study [J] . Saeedeh Pourahmad, Atefeh Basirat, Amir Rahimi, Computational and mathematical methods in medicine . 2020,第1期

机译：初始簇质心的确定是否提高了K-Means聚类算法的性能？应用研究中遗传算法，最小生成树和分层聚类的三种混合方法的比较
3. The K-Means Clustering Algorithm With Semantic Similarity To Estimate The Cost of Hospitalization [J] . Ida Bagus Gede Sarasvananda, Retantyo Wardoyo, Anny Kartika Sari Indonesian Journal of Computing and Cybernetics Systems . 2019,第4期

机译：K-means聚类算法具有语义相似性，以估计住院费用
4. A Novel on Altered K-Means Algorithm for Clustering Cost Decrease of Non-labeling Big-Data [C] . Se-Hoon Jung, Won-Ho So, Kang-Soo You, International conference on future information technology . 2019

机译：改变k均值算法的一种新颖的非标记大数据的聚类成本减少
5. Hardware Implementation and Performance Evaluation of K-Means and K-Means++ Clustering Algorithms [D] . Singh, Manisha . 2019

机译：K-Means和K-Means ++聚类算法的硬件实现和性能评估
6. Does Determination of Initial Cluster Centroids Improve the Performance of K-Means Clustering Algorithm? Comparison of Three Hybrid Methods by Genetic Algorithm Minimum Spanning Tree and Hierarchical Clustering in an Applied Study [O] . Saeedeh Pourahmad, Atefeh Basirat, Amir Rahimi, 2020

机译：初始簇质心的确定是否提高了K-Means聚类算法的性能？应用研究中遗传算法最小生成树和分层聚类的三种混合方法的比较
7. Does Determination of Initial Cluster Centroids Improve the Performance of K-Means Clustering Algorithm? Comparison of Three Hybrid Methods by Genetic Algorithm, Minimum Spanning Tree, and Hierarchical Clustering in an Applied Study [O] . Saeedeh Pourahmad, Atefeh Basirat, Amir Rahimi, 2020

机译：初始簇质心的确定是否提高了K-Means聚类算法的性能？应用研究中遗传算法，最小生成树和分层聚类的三种混合方法的比较

A Novel on Altered K-Means Algorithm for Clustering Cost Decrease of Non-labeling Big-Data

摘要

著录项

相似文献

相关主题

期刊订阅