首页> 外文会议>2010 IEEE International Conference on Cluster Computing >Host Side Dynamic Reconfiguration with InfiniBand
【24h】

Host Side Dynamic Reconfiguration with InfiniBand

机译:使用InfiniBand进行主机端动态重新配置

获取原文
获取外文期刊封面目录资料

摘要

Rerouting around faulty components and migration of jobs both require reconfiguration of data structures in the Queue Pairs residing in the hosts on an InfiniBand cluster. In this paper we report an implementation of dynamic reconfiguration of such host side data-structures. Our implementation preserves the Queue Pairs, and lets the application run without being interrupted. With this implementation, we demonstrate a complete solution to fault tolerance in an InfiniBand network, where dynamic network reconfiguration to a topology-agnostic routing function is used to avoid malfunctioning components. This solution is in principle able to let applications run uninterruptedly on the cluster, as long as the topology is physically connected. Through measurements on our test-cluster we show that the increased cost of our method in setup latency is negligible, and that there is only a minor reduction in throughput during reconfiguration.
机译:REROUTING周围的故障组件和作业的迁移都需要重新配置驻留在INFINIBAND群集上的主机中的队列对中的数据结构。在本文中,我们报告了这种主机侧数据结构的动态重新配置的实现。我们的实现保留了队列对,并允许应用程序运行而不会中断。通过这种实现,我们向InfiniBand网络中的容错解决了一个完整的解决方案,其中用于拓扑无神不可转的路由功能的动态网络重新配置用于避免故障组件。此解决方案原则上能够让应用程序在群集中不间断地运行,只要拓扑物理连接。通过对我们的测试群集进行测量,我们显示我们在设置延迟中的方法的增加的成本可以忽略不计,并且在重新配置期间吞吐量只有轻微降低。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号