首页> 外文会议>International Conference on Cloud and Autonomic Computing >SCOUT: A Monitor and Profiler of Grid Resources for Large-Scale Scientific Computing
【24h】

SCOUT: A Monitor and Profiler of Grid Resources for Large-Scale Scientific Computing

机译:SCOUT:大型科学计算的网格资源的监视器和分析器

获取原文

摘要

Computational Grids consist of heterogeneous collections of geographically distributed computing resources and have supported numerous scientific applications that require substantial amounts of computing power and storage space. From the point of view of scientists who want to leverage these Grid computing resources, effectively locating appropriate computing resources with minimized allocation overheads is crucial to successfully execute large-scale scientific applications. However, Grid resource availability is highly unstable and current Grid Information Service (GIS) does not provide accurate state information of computing resources. This can make it very difficult for users and systems (Schedulers, Resource brokers) to schedule the jobs in the Grid system and to map tasks on appropriate available resources. In this paper, we present SCOUT system that can provide scientific users with current state information about Grid computing resources including the number of available CPU cores and average response time to get resources allocated. With the help of SCOUT, we can periodically profile resource availability of the Computing Elements (CE) in Grids and monitor their average response time and performance. It provides a mechanism to find out the number of available CPU cores required for the applications to execute their tasks within shortest expected time which can accelerate the productivity of leveraging Grid computing resources for solving complex and challenging scientific problems. We have performed resource profiling based on SCOUT system on two different VO(Virtual Organization)s during one month period and based on that information, we could successfully perform large-scale drug repositioning simulations over 2,000 CPU cores.
机译:计算网格包括地理分布式计算资源的异构集合,并支持许多需要大量计算能力和存储空间的科学应用。从想要利用这些电网计算资源的科学家的角度来看,有效地定位适当的计算资源,最小化分配开销是成功执行大规模科学应用的关键。但是,网格资源可用性高度不稳定,并且当前网格信息服务(GIS)不提供计算资源的准确状态信息。这可以使用户和系统(调度员,资源代理)非常困难,以在网格系统中安排作业并在适当的可用资源上映射任务。在本文中,我们呈现了侦察系统,可以提供关于网格计算资源的当前状态信息的科学用户,包括可用CPU内核的数量和获得分配资源的平均响应时间。在Scout的帮助下,我们可以定期配置网格中计算元素(CE)的资源可用性,并监控其平均响应时间和性能。它提供了一种机制,找出应用程序在最短预期时间内执行其任务所需的可用CPU核心的数量,这可以加速利用电网计算资源的生产率来解决复杂和具有挑战性的科学问题。我们在一个月期间基于两个不同的VO(虚拟组织)S的SCOUT系统进行了资源分析,并基于该信息,我们可以成功地执行超过2,000个CPU内核的大规模药物重新定位模拟。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号