SCOUT: A Monitor and Profiler of Grid Resources for Large-Scale Scientific Computing

机译：SCOUT：用于大型科学计算的网格资源监视和分析器

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Computational Grids consist of heterogeneous collections of geographically distributed computing resources and have supported numerous scientific applications that require substantial amounts of computing power and storage space. From the point of view of scientists who want to leverage these Grid computing resources, effectively locating appropriate computing resources with minimized allocation overheads is crucial to successfully execute large-scale scientific applications. However, Grid resource availability is highly unstable and current Grid Information Service (GIS) does not provide accurate state information of computing resources. This can make it very difficult for users and systems (Schedulers, Resource brokers) to schedule the jobs in the Grid system and to map tasks on appropriate available resources. In this paper, we present SCOUT system that can provide scientific users with current state information about Grid computing resources including the number of available CPU cores and average response time to get resources allocated. With the help of SCOUT, we can periodically profile resource availability of the Computing Elements (CE) in Grids and monitor their average response time and performance. It provides a mechanism to find out the number of available CPU cores required for the applications to execute their tasks within shortest expected time which can accelerate the productivity of leveraging Grid computing resources for solving complex and challenging scientific problems. We have performed resource profiling based on SCOUT system on two different VO(Virtual Organization)s during one month period and based on that information, we could successfully perform large-scale drug repositioning simulations over 2,000 CPU cores.

机译：计算网格由地理分布的计算资源的异构集合组成，并支持了需要大量计算能力和存储空间的众多科学应用。从希望利用这些Grid计算资源的科学家的角度来看，以最小的分配开销有效地定位适当的计算资源对于成功执行大规模科学应用至关重要。但是，网格资源可用性非常不稳定，并且当前的网格信息服务（GIS）无法提供计算资源的准确状态信息。这会使用户和系统（计划程序，资源代理）很难调度Grid系统中的作业，并难以在适当的可用资源上映射任务。在本文中，我们介绍了SCOUT系统，该系统可以为科学用户提供有关网格计算资源的当前状态信息，包括可用CPU内核的数量和平均响应时间以获取资源分配。借助SCOUT，我们可以定期剖析Grid中计算元素（CE）的资源可用性，并监视其平均响应时间和性能。它提供了一种机制，可以找出应用程序在最短的预期时间内执行任务所需的可用CPU内核数量，从而可以利用网格计算资源来解决复杂且具有挑战性的科学问题，从而提高生产率。我们在一个月的时间内对两个不同的VO（虚拟组织）进行了基于SCOUT系统的资源剖析，并根据该信息成功地对2,000个CPU内核进行了大规模的药物重新定位模拟。

著录项

来源
《International Conference on Cloud and Autonomic Computing》|2015年|260-267|共8页
会议地点
作者
Hossain Md Azam; Hieu Trong Vu; Jik-Soo Kim; Myungho Lee; Soonwook Hwang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
grid computing; resource allocation; scheduling; virtualisation; CE; GIS; SCOUT system; VO; computing elements; grid computing resource; grid information service; grid resource monitor; grid resource profiler; job scheduling; scientific computing; virtual organization; Databases; Instruction sets; Monitoring; Organizations; Processor scheduling; Scheduling; Time factors;

机译：网格计算;资源分配;调度;虚拟化; CE; GIS; SCOUT系统; VO;计算元素;网格计算资源;网格信息服务;网格资源监控器;网格资源剖析器;作业调度;科学计算;虚拟组织;数据库;指令集;监控;组织;处理器调度;调度;时间因素;

相似文献

外文文献
中文文献
专利

1. Exploiting resource profiling mechanism for large-scale scientific computing on grids [J] . Hossain Md. Azam, Cao Ngoc Nguyen, Kim Jik-Soo, Cluster computing . 2016,第3期

机译：利用资源剖析机制进行大规模网格科学计算
2. A Grid Portal to Support High-Performance Scientific Computing on Distributed Resources [J] . Jacobo TARRIO, Juan TOURINO, Maria J. MARTIN, IEICE Transactions on Information and Systems . 2004,第7期

机译：一个网格门户，可支持分布式资源的高性能科学计算
3. A New Mechanism For Resource Monitoring In Grid Computing [J] . Wu-Chun Chung, Ruay-Shiung Chang Future generation computer systems . 2009,第1期

机译：网格计算中资源监视的新机制
4. SCOUT: A Monitor and Profiler of Grid Resources for Large-Scale Scientific Computing [C] . Hossain Md Azam, Hieu Trong Vu, Jik-Soo Kim, International Conference on Cloud and Autonomic Computing . 2015

机译：SCOUT：大型科学计算的网格资源的监视器和分析器
5. A globally distributed grid monitoring system to facilitate high-performance computing at DO/SAM-grid (design, development, implementation and deployment of a prototype). [D] . Rana, Abhishek Singh. 2002

机译：全球分布的网格监视系统，可促进DO / SAM网格的高性能计算（原型的设计，开发，实施和部署）。
6. Integrating Clinical Trial Imaging Data Resources Using Service-Oriented Architecture and Grid Computing [O] . Stefan Baumann El-Ghatta, Thierry Cladé, Joshua C. Snyder -1

机译：使用面向服务的体系结构和网格计算集成临床试验成像数据资源
7. Dynamically reconfigurable scientific computing on large-scale heterogeneous grids [O] . Boleslaw Szymanski, Carlos Varela, John Cummings, 2003

机译：大规模异构网格上的动态可重构科学计算
8. Using Computing and Data Grids for Large-Scale Science and Engineering [R] . Johnson, W. E. 2001

机译：将计算和数据网格用于大规模科学和工程

SCOUT: A Monitor and Profiler of Grid Resources for Large-Scale Scientific Computing

摘要

著录项

相似文献

相关主题

期刊订阅