【24h】

Performance Research on MapReduce Programming Model

机译:MapReduce编程模型的性能研究

获取原文
获取原文并翻译 | 示例

摘要

Map Reduce programming model is designed to process large data sets in-parallel on large clusers. But most organizations can't afford to built a large cluster, so building a small cluster to improve the efficience of time-consuming applications is a perfect solution. Besides, non-data-intensive programs are common. Is Map Reduce suitable for this kind of programs? In this paper, a small cluster consisting of 5 PCs is built, with its configuration adjusted. Then a distributed FTP scan program is written to test whether Map Reduce is suitable for small data sets, network-intensive program. Finally a distributed string search program is written to test the performance of Map Reduce on large data sets. The results show that Map Reduce can run efficiently on small cluster, and it's also suitable for small data sets, network-I/O-intensive programs.
机译:Map Reduce编程模型设计用于在大型cluser上并行处理大型数据集。但是大多数组织负担不起构建大型集群,因此构建小型集群以提高耗时的应用程序的效率是一个完美的解决方案。此外,非数据密集型程序很常见。 Map Reduce是否适合此类程序?本文构建了一个由5台PC组成的小型集群,并对其配置进行了调整。然后编写一个分布式FTP扫描程序,以测试Map Reduce是否适合于小型数据集,网络密集型程序。最后,编写了一个分布式字符串搜索程序,以测试Map Reduce在大型数据集上的性能。结果表明,Map Reduce可以在小型集群上高效运行,并且还适用于小型数据集,网络I / O密集型程序。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号