Kernelized Spectral Clustering based Conditional MapReduce function with big data

K. Maheswari; M. Ramakrishnan

首页> 外文期刊>International Journal of Computers & Applications >Kernelized Spectral Clustering based Conditional MapReduce function with big data

【24h】

Kernelized Spectral Clustering based Conditional MapReduce function with big data

机译：基于内核的基于条件MAPREDUCE功能，具有大数据

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Clustering is the significant data mining technique for big data analysis, where large volume data are grouped. The resulting of clustering is to minimize the dimensionality while accessing large volume of data. The several data mining techniques have been developed for clustering the data. But the problem of clustering becomes increasing rapidly in recent years since the existing clustering algorithm failed to minimize the clustering time and majority of techniques require huge memory to perform clustering task. In order to improve clustering accuracy and minimize the dimensionality, a Kernelized Spectral Clustering based Conditional Maximum Entropy MapReduce (KSC-CMEMR) technique is introduced. The number of data is collected from big dataset. The KSC-CMEMR technique partitions the data into different clusters using Kernelized Spectral Clustering Process based on the spectrum of similarity matrix and to perform dimensionality reduction. Based on the similarity, the Kernelized Spectral Clustering is carried out with higher clustering accuracy. After that, Conditional Maximum Entropy MapReduce model eliminates the irrelevant data present in the cluster. The designed model predicts the maximum probabilities of data become a member of the cluster and remove the irrelevant data from the cluster. This helps to reduce the false positive and space complexity. Experimental evaluation is carried out with certain parameters such as clustering accuracy, clustering time, false positive rate, and space complexity with respect to the number of data. The experimental results reported that the proposed KSC-CMEMR technique obtains high clustering accuracy with minimum time as well as space complexity.

机译：聚类是大数据分析的重要数据挖掘技术，其中大量数据分组。群集的结果是在访问大量数据时最小化维度。已经开发了几种数据挖掘技术来聚类数据。但是，由于现有的聚类算法未能最小化群集时间，因此大多数技术需要巨大内存来执行聚类任务，因此近年来群化的问题变得迅速增加。为了提高聚类精度并最小化维度，引入了基于核化的基于谱聚类的条件最大熵映射（KSC-CMEMR）技术。从大数据集收集数据数量。 KSC-CMEMR技术使用基于相似性矩阵的频谱的封闭光谱聚类处理将数据分配到不同的簇中，并执行维度减少。基于相似性，通过较高的聚类精度进行核化谱聚类。之后，条件最大熵MapReduce模型消除了集群中存在的无关数据。设计的模型预测数据的最大概率成为群集的成员，并从群集中删除无关数据。这有助于降低假的正面和空间复杂性。实验评估与某些参数进行，例如聚类准确性，聚类时间，假阳性率和相对于数据数量的空间复杂性。实验结果报告说，所提出的KSC-CMEMR技术具有最小时间和空间复杂度的高聚类精度。

著录项

来源
《International Journal of Computers & Applications》 |2021年第8期|601-611|共11页
作者
K. Maheswari; M. Ramakrishnan;
展开▼
作者单位

Department of Computer Science Bharathiar University Coimbatore India;

School of Information Technology Madurai Kamaraj University Madurai India;

展开▼
收录信息美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Big data analytics; clustering; dimensionality reduction; Kernelized Spectral Clustering; Conditional Maximum Entropy MapReduce; irrelevant data;

机译：大数据分析;聚类;减少维度;封闭谱聚类;条件最大熵MapReduce;无关的数据;

相似文献

外文文献
中文文献
专利

1. Particle swarm optimization of kernel-based fuzzy c-means for hyperspectral data clustering [J] . Saeid Niazmardi, Amin Alizadeh Naeini, Saeid Homayouni, Journal of Applied Remote Sensing . 2012,第Null期

机译：基于核的模糊c均值粒子群算法在高光谱数据聚类中的应用
2. Spectral Clustering Algorithm Based on Improved Gaussian Kernel Function and Beetle Antennae Search with Damping Factor [J] . Zhe Zhang, Xiyu Liu, Lin Wang Computational intelligence and neuroscience . 2020,第4期

机译：基于改进的高斯内核函数和阻尼因子的甲虫天线搜索的光谱聚类算法
3. Parallel Spectral Clustering Based on MapReduce [J] . Qiwei Zhong, Yunlong Lin, Junyang Zou, 中兴通讯技术（英文版） . 2013,第002期

机译：基于MapReduce的并行谱聚类
4. Distributed Extreme Learning Machine with Kernels Based on Mapreduce for Spectral-Spatial Classification of Hyperspectral Image [C] . Junhua Ku, Bing Zheng IEEE International Conference on Computational Science and Engineering;IEEE/IFIP International Conference on Embedded and Ubiquitous Computing . 2017

机译：基于Mapreduce的带核分布式极限学习机用于高光谱图像的光谱空间分类
5. Kernel-based clustering of big data. [D] . Chitta, Radha. 2015

机译：基于内核的大数据集群。
6. Spectral Clustering Algorithm Based on Improved Gaussian Kernel Function and Beetle Antennae Search with Damping Factor [O] . Zhe Zhang, Xiyu Liu, Lin Wang 2020

机译：基于改进的高斯内核函数和阻尼因子的甲虫天线搜索的光谱聚类算法
7. Spectral Clustering Algorithm Based on Improved Gaussian Kernel Function and Beetle Antennae Search with Damping Factor [O] . Zhe Zhang, Xiyu Liu, Lin Wang 2020

机译：基于改进的高斯内核函数和阻尼因子的甲虫天线搜索的光谱聚类算法

Kernelized Spectral Clustering based Conditional MapReduce function with big data

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅