首页> 外文会议>International Conference on Advance in Ambient Computing and Intelligence >Hadoop-Based University Ideological and Political Big Data Platform Design and Behavior Pattern Mining
【24h】

Hadoop-Based University Ideological and Political Big Data Platform Design and Behavior Pattern Mining

机译:地Hadoop的大学思想政治大数据平台设计与行为模式挖掘

获取原文

摘要

The combination of higher education and big data technology is not only the focus of the application of big data technology, but also an emerging field of self-development in the field of higher education. This article is dedicated to building a big data processing platform through Hadoop big data storage architecture, Hive, flume data collection technology, and Sqoop data synchronization technology to achieve efficient processing of big data sets. The traditional data mining algorithm is implemented using Map Reduce programming, and the implementation of the data mining algorithm of the Hadoop platform is studied, mainly to analyze its execution efficiency and scalability. We select the data clustering task in data mining as a representative, and write its Map Reduce version to test and verify its effect on the Hadoop platform. Through comparative experiments of different cluster sizes and different data sizes, it is concluded that the use of Hadoop distributed systems for data mining tasks has a good acceleration ratio and efficiency, and the extended performance analysis of computing power also shows that it has great potential.
机译:高等教育和大数据技术的组合不仅是大数据技术应用的重点,而且是高等教育领域的新兴自我发展领域。本文致力于通过Hadoop大数据存储体系结构,Hive,Flume数据收集技术和SQOPOM数据同步技术构建大数据处理平台,以实现大数据集的高效处理。使用地图实现传统的数据挖掘算法,缩小编程,研究了Hadoop平台的数据挖掘算法的实现,主要是分析其执行效率和可扩展性。我们选择数据挖掘中的数据聚类任务作为代表性,并写下其映射减少版本以测试并验证其对Hadoop平台的影响。通过不同聚类尺寸和不同数据尺寸的比较实验,得出结论,使用Hadoop分布式系统进行数据挖掘任务具有良好的加速度和效率,并且计算功率的扩展性能分析也表明它具有很大的潜力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号