首页> 外文会议>IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing >Using the Cloud for Parameter Estimation Problems: Comparing Spark vs MPI with a Case-Study
【24h】

Using the Cloud for Parameter Estimation Problems: Comparing Spark vs MPI with a Case-Study

机译:使用云解决参数估计问题:比较Spark与MPI与案例研究

获取原文

摘要

Systems biology is an emerging approach focused in generating new knowledge about complex biological systems by combining experimental data with mathematical modeling and advanced computational techniques. Many problems in this field are extremely challenging and require substantial supercomputing resources to be solved. This is the case of parameter estimation in large-scale nonlinear dynamic systems biology models. Recently, Cloud Computing has emerged as a new paradigm for on-demand delivery of computing resources. However, scientific computing community has been quite hesitant in using the Cloud, simply because traditional programming models do not fit well with the new paradigm, and the earliest cloud programming models do not allow most scientific computations being efficiently run in the Cloud. In this paper we explore and compare two distributed computing models: the MPI (message-passing interface) model, that is high-performance oriented, and the Spark model, which is throughput oriented but outperforms other cloud programming solutions adding improved support for iterative algorithms through in-memory computing. The performance of a very well known metaheuristic, the Differential Evolution algorithm, has been thoroughly assessed using a challenging parameter estimation problem from the domain of computational systems biology. The experiments have been carried out both in a local cluster and in the Microsoft Azure public cloud, allowing performance and cost evaluation for both infrastructures.
机译:系统生物学是一种新兴的方法,致力于通过将实验数据与数学建模和先进的计算技术相结合来生成有关复杂生物系统的新知识。该领域中的许多问题极具挑战性,需要解决大量的超级计算资源。在大型非线性动态系统生物学模型中,参数估计就是这种情况。最近,云计算已经成为按需交付计算资源的新范例。但是,科学计算社区在使用云时一直很犹豫,这仅仅是因为传统的编程模型不能很好地适应新的范例,并且最早的云编程模型不允许大多数科学计算在云中有效运行。在本文中,我们探索并比较了两种分布式计算模型:面向高性能的MPI(消息传递接口)模型和面向吞吐量但优于其他云编程解决方案的Spark模型,从而增加了对迭代算法的支持通过内存计算。已经使用来自计算系统生物学领域的具有挑战性的参数估计问题彻底评估了一种非常著名的元启发式算法-差分进化算法的性能。实验已在本地群集和Microsoft Azure公共云中进行,从而可以评估两个基础结构的性能和成本。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号