首页> 外国专利> Programming model and framework for providing resilient parallel tasks

Programming model and framework for providing resilient parallel tasks

机译:提供弹性并行任务的编程模型和框架

摘要

Exemplary embodiments herein describe programming models and frameworks for providing parallel and resilient tasks. Tasks are created in accordance with predetermined structures. Defined tasks are stored as data objects in a shared pool of memory that is made up of disaggregated memory communicatively coupled via a high performance interconnect that supports atomic operations as descried herein. Heterogeneous compute nodes are configured to execute tasks stored in the shared memory. When compute nodes fail, they do not impact the shared memory, the tasks or other data stored in the shared memory, or the other non-failing compute nodes. The non-failing compute nodes can take on the responsibility of executing tasks owned by other compute nodes, including tasks of a compute node that fails, without needing a centralized manager or schedule to re-assign those tasks. Task processing can therefore be performed in parallel and without impact from node failures.
机译:这里示例性实施例描述了用于提供并行和弹性任务的编程模型和框架。任务是根据预定结构创建的。定义的任务被存储为在共享存储器中的数据对象存储,该数据对象由通过支持本文描述的原子操作的高性能互连通信地耦合的分组存储器组成。异构计算节点被配置为执行存储在共享存储器中的任务。当计算节点失败时,它们不会影响存储在共享内存中的共享内存,任务或其他数据,或者其他非失败计算节点。非失败的计算节点可以承担执行其他计算节点拥有的任务的责任,包括失败的计算节点的任务,而无需将集中管理器或计划重新分配这些任务。因此,可以并行执行任务处理,而不会从节点故障的影响。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号