基于 Dpark 的数据分析方法的性能研究磁

马燕龙; 吴云

首页> 中文期刊> 《计算机与数字工程》 >基于 Dpark 的数据分析方法的性能研究磁

基于 Dpark 的数据分析方法的性能研究磁

开具论文收录证明 >>

期刊封面封底目录下载 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Distributed computing has got extensive application with the coming of the big data era .Open source distrib‐uted computing frameworks headed by Hadoop and Spark lead the relevant industry standards .However ,there are difficul‐ties in using and second‐round developing Hadoop and Spark ,while the former is programmed with Java and the latter is pro‐grammed with Scala .But Dpark ,a distributed computing framework programmed with Python ,extremely improves work ef‐ficiency of data analysis ,because it not only inherits the mechanism of memory calculation and lazy evaluation from Spark , but also combines with the concise syntax of Python .What's more ,it is able to cooperate with MooseFS ,which is a distribu‐ted file system ,Beansdb ,which is a distributed database ,and Mesos ,which is a distributed resources scheduling frame‐work ,naturally .The work efficiency of traditional Python program and the Dpark‐based program in data preprocessing are compared ,while the performance and scalability of the latter is better than the former .%随着大数据时代的来临，以 Hadoop 和 Spark 为首的开源分布式计算框架主导着相关行业的事实标准。然而，无论是使用 Java 编写的 Hadoop ，还是使用 Scala 编写的 Spark ，使用及对其进行二次开发的难度都比较大，而使用 Py‐thon 编写的分布式计算框架 Dpark ，具有继承自 Spark 的内存计算和惰性求值机制，结合 Python 的简洁语法，同时又配合分布式文件系统 MooseFS 、分布式数据库 Beansdb 和分布式资源调度框架 Mesos ，可以极大提高数据分析的工作效率。文章主要对比了传统 Python 程序和基于 Dpark 的 Python 程序在完成数据预处理工作上的运行效率，得出后者的性能和可扩展性至少优于前者数十倍的结论。

著录项

来源
《计算机与数字工程》 |2016年第4期|691-693771|共4页
作者
马燕龙; 吴云;
展开▼
作者单位

贵州大学计算机科学与技术学院贵阳 550025;

贵州大学计算机科学与技术学院贵阳 550025;

展开▼
原文格式 PDF
正文语种 chi
中图分类软件工程;
关键词
Dpark 框架; 集群部署; 数据预处理;

相似文献

中文文献
外文文献
专利

1. 基于磁控磨料定向的SiC固相芬顿反应研抛盘制备及性能研究 [J] . 路家斌 ,曾帅 ,阎秋生 . 表面技术 . 2021,第10期
2. 基于复杂网络的磁流变橡胶磁致压缩力学性能研究 [J] . 柳彬 ,游世辉 ,赵树勋 . 铁道科学与工程学报 . 2019,第004期
3. 基于人工蜂群优化算法的自励磁发电机性能评估研究 [J] . 黄鹏翔 ,范磊 ,张孝 . 陕西电力 . 2018,第005期
4. 基于人工蜂群优化算法的自励磁发电机性能评估研究 [J] . 黄鹏翔1 ,范磊2 ,张孝2 . 智慧电力 . 2018,第005期
5. 基于琼脂糖的磁响应型光子晶体水凝胶的制备及性能研究 [J] . 张文莹 ,蒋文凤 ,卢学刚 . 光散射学报 . 2017,第004期
6. 基于感应励磁的混合励磁同步发电机性能 [C] . Zhu Changqing ,朱常青 ,Wang Xiuhe . 第七届电工技术前沿问题学术论坛 . 2016
7. 基于赋权评分和Dpark的分布式推荐系统研究与实现 [A] . 郭敬泽 . 2015

基于 Dpark 的数据分析方法的性能研究磁

摘要

著录项

相似文献

相关主题

期刊订阅