【24h】

Spruce: A System for Supporting Urgent High-Performance Computing

机译:云杉:支持紧急高性能计算的系统

获取原文
获取原文并翻译 | 示例

摘要

Modeling and simulation using high-performance computing are playing an increasingly important role in decision making and prediction. For time-critical emergency decision support applications, such as influenza modeling and severe weather prediction, late results may be useless. A specialized infrastructure is needed to provide computational resources quickly. This paper describes the architecture and implementation of SPRUCE, a system for supporting urgent computing on both traditional supercomputers and distributed computing Grids. Currently deployed on the TeraGrid, SPRUCE provides users with "right-of-way tokens" that can be activated from a Web-based portal or Web service invocation in the event of an urgent computing need. Tokens are trans-ferrable and can be restricted to specific resource sets and priority levels. Once a session is activated, job submissions may request elevated priority. Based on local policy, computing resources can respond, for example, by preempting active jobs or raising the job's priority in the queue. This paper also explores the strengths and weaknesses of the SPRUCE architecture and token-based activation for urgent computing applications.
机译:使用高性能计算的建模和仿真在决策和预测中起着越来越重要的作用。对于时间紧迫的紧急决策支持应用程序(例如流感建模和恶劣天气预测),后期结果可能没有用。需要专门的基础架构来快速提供计算资源。本文描述了SPRUCE的体系结构和实现,SPRUCE是在传统的超级计算机和分布式计算网格上都支持紧急计算的系统。 SPRUCE当前部署在TeraGrid上,为用户提供“通行权令牌”,如果有紧急计算需求,可以从基于Web的门户或Web服务调用中激活它。令牌是可转让的,可以限制为特定的资源集和优先级。激活会话后,作业提交可以请求更高的优先级。根据本地策略,计算资源可以通过抢占活动作业或提高作业在队列中的优先级进行响应。本文还探讨了SPRUCE体系结构和针对紧急计算应用程序的基于令牌的激活的优缺点。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号