虚拟环境下大数据智能并行聚类方法研究

屈洁

首页> 中文期刊> 《计算机测量与控制》 >虚拟环境下大数据智能并行聚类方法研究

虚拟环境下大数据智能并行聚类方法研究

开具论文收录证明 >>

期刊封面封底目录下载 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In order to reduce the virtual environment data,operation time data at runtime to reflect certain regularity and the classification of,need to intelligent virtual environment big data parallel clustering.The current big data clustering method is the k-means clustering method are based on continuously to adjust the large data samples,after adjusting for multiple computing data parallel clustering effect,but every time a new big data flows,all need to the k-means clustering,are all current data calculation process is complex,low efficiency of the clustering.To this end,this paper proposes a intelligent virtual environment based on graphs of data parallel clustering method.Extracted first big data in a virtual environment of small data set and determine the center of mass of big data clusters,by using the method of Single sampling of small data clustering,average get attributes of the virtual environment is big data,using the minimum distance classification rules will be big data attributes mean to quickly move to real data cluster center,on the basis of Davies-bouldin index assuming a bunch of discrete degree of parameters and data in the parameter values chosen big data intelligent maximum parallel clustering similarity,finally obtained by clustering similarity maximum Davies-bouldin index,on the basis of Davies-bouldin index to multiple categories of discrete degree of specified threshold centroid distance and clustering combined into a class and the iterative calculation,get the best data clustering center position,resulting in a virtual environment intelligent parallel clustering large data.The simulation experimental results show that the proposed method improves the big data intelligent parallel clustering flexibility and generality,reduce the time of clustering,and suitable for application in the field of education technology,not only can make education technology network data more rationalization,and more standardized.%为了减少虚拟环境下大数据运行时间,数据运行时能够反映出一定的规律性和特殊的分类性,需要对虚拟环境下大数据进行智能并行聚类;当前大数据聚类方法是根据K-均值聚类方法不断地进行大数据样本分类的调整,经过多次计算调整后达到数据并行聚类的效果,但每当有新的大数据流入时,都需要对当前全部数据进行K-均值聚类,计算过程复杂,聚类效率低;为此,提出了一种基于MapReduce的虚拟环境下大数据智能并行聚类方法;首先在虚拟环境下大数据中抽取小规模数据集并确定大数据簇的质心,采用Single法对所抽样的小规模数据进行聚类,获得虚拟环境下大数据属性的均值,利用最小距离分类规则将大数据属性的均值快速地向数据簇的真实中心移动,依据Davies bouldin指标假设一个数据簇离散度参数,在此参数值中选出大数据智能并行聚类相似度最大值,最后利用聚类相似度最大值得到Davies-bouldin指数,以Davies-bouldin指数为基础将多个类别的质心间距以及聚类离散度指定阈值合并为一个类并进行迭代计算,得到数据最佳聚类中心位置,由此完成虚拟环境下大数据智能并行聚类;仿真实验结果证明,所提方法提高了大数据智能并行聚类的灵活性和普遍适用性,减少了聚类时间,并适合应用于教育技术领域,不仅可以使教育技术网络数据更加合理化,而且更加规范化.

著录项

来源
《计算机测量与控制》 |2017年第6期|257-260|共4页
作者
屈洁;
展开▼
作者单位

宝鸡文理学院教育学院;

陕西宝鸡721016;

展开▼
原文格式 PDF
正文语种 chi
中图分类程序设计、软件工程;
关键词
虚拟环境; 大数据; 智能并行; 聚类方法;

相似文献

中文文献
外文文献
专利

1. 基于Spark的分布式大数据并行化聚类方法研究 [J] . 陶婧 . 湖北第二师范学院学报 . 2019,第008期
2. 增广链修复下大数据并行搜索聚类算法 [J] . 何玉新 . 科技通报 . 2016,第3期
3. 加密环境下大数据特征集并行存储方法研究 [J] . 李蓉蓉 . 现代电子技术 . 2019,第018期
4. 云环境下基于群智能算法的大数据聚类挖掘技术 [J] . 郑琳 ,张辉 . 现代电子技术 . 2020,第015期
5. 云环境下基于群智能算法的大数据聚类挖掘技术 [J] . 刘云恒 . 现代电子技术 . 2019,第009期
6. 视频大数据背景下的智能交通精细化管理——浅谈智能分析技术在交通视频大数据背景下的业务应用 [C] . 涂清松 . （第二期）江西智能交通论坛 . 2016
7. 高维大数据分析模型与分布式并行聚类方法研究 [A] . 周昉昉 . 2018

虚拟环境下大数据智能并行聚类方法研究

摘要

著录项

相似文献

相关主题

期刊订阅