...
首页> 外文期刊>Journal of Parallel and Distributed Computing >Transparent runtime parallelization of the R scripting language
【24h】

Transparent runtime parallelization of the R scripting language

机译:R脚本语言的透明运行时并行化

获取原文
获取原文并翻译 | 示例
           

摘要

Scripting languages such as R and Matlab are widely used in scientific data processing. As the data volume and the complexity of analysis tasks both grow, sequential data processing using these tools often becomes the bottleneck in scientific workflows. We describe pR, a runtime framework for automatic and transparent parallelization of the popular R language used in statistical computing. Recognizing scripting languages' interpreted nature and data analysis codes' use pattern, we propose several novel techniques: (1) applying parallelizing compiler technology to runtime, whole-program dependence analysis of scripting languages, (2) incremental code analysis assisted with evaluation results, and (3) runtime parallelization of file accesses. Our framework does not require any modification to either the source code or the underlying R implementation. Experimental results demonstrate that pR can exploit both task and data parallelism transparently and overall has better performance as well as scalability compared to an existing parallel R package that requires code modification.
机译:R和Matlab等脚本语言已广泛用于科学数据处理中。随着数据量和分析任务复杂性的增长,使用这些工具进行顺序数据处理通常成为科学工作流程的瓶颈。我们描述了pR,这是一种用于统计计算中流行的R语言的自动和透明并行化的运行时框架。认识到脚本语言的解释性质和数据分析代码的使用模式,我们提出了几种新颖的技术:(1)将并行化编译器技术应用于运行时,对脚本语言进行全程序依赖性分析,(2)辅助评估评估结果的增量代码, (3)文件访问的运行时并行化。我们的框架不需要对源代码或基础R实现进行任何修改。实验结果表明,与现有的需要代码修改的并行R包相比,pR可以透明地利用任务和数据并行性,并且总体上具有更好的性能和可伸缩性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号