首页> 外文会议>International Conference "Numerical Computations: Theory and Algorithms" >Automatic Discovery of the Communication Network Topology for Building a Supercomputer Model
【24h】

Automatic Discovery of the Communication Network Topology for Building a Supercomputer Model

机译:自动发现构建超级计算机模型的通信网络拓扑

获取原文

摘要

The Research Computing Center of Lomonosov Moscow State University is developing the Octotron software suite for automatic monitoring and mitigation of emergency situations in supercomputers so as to maximize hardware reliability. The suite is based on a software model of the supercomputer. The model uses a graph to describe the computing system components and their interconnections. One of the most complex components of a supercomputer that needs to be included in the model is its communication network. This work describes the proposed approach for automatically discovering the Ethernet communication network topology in a supercomputer and its description in terms of the Octotron model. This suite automatically detects computing nodes and switches, collects information about them and identifies their interconnections. The application of this approach is demonstrated on the "Lomonosov" and "Lomonosov-2" supercomputers.
机译:Lomonosov莫斯科国立大学的研究中心正在开发偶极电脑软件套件,用于在超级计算机上自动监测和减轻紧急情况,以最大限度地提高硬件可靠性。该套件基于超级计算机的软件模型。该模型使用图表来描述计算系统组件及其互连。需要包含在模型中的超级计算机的最复杂组件之一是其通信网络。这项工作描述了所提出的方法,用于在超级计算机中自动发现以太网通信网络拓扑的方法及其描述在八十八乐模型方面。此套件会自动检测计算节点和交换机,收集有关它们的信息并识别其互连。在“Lomonosov”和“Lomonosov-2”超级计算机上证明了这种方法的应用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号