首页> 外文期刊>Pattern recognition letters >Multiclass classification of distributed memory parallel computations
【24h】

Multiclass classification of distributed memory parallel computations

机译:分布式内存并行计算的多类分类

获取原文
获取原文并翻译 | 示例
       

摘要

High Performance Computing (HPC) is a field concerned with solving large-scale problems in science and engineering. However, the computational infrastructure of HPC systems can also be misused as demonstrated by the recent commoditization of cloud computing resources on the black market. As a first step towards addressing this, we introduce a machine learning approach for classifying distributed parallel computations based on communication patterns between compute nodes. We first provide relevant background on message passing and computational equivalence classes called dwarfs and describe our exploratory data analysis using self organizing maps. We then present our classification results across 29 scientific codes using Bayesian networks and compare their performance against Random Forest classifiers. These models, trained with hundreds of gigabytes of communication logs collected at Lawrence Berkeley National Laboratory, perform well without any a priori information and address several shortcomings of previous approaches.
机译:高性能计算(HPC)是与解决科学和工程学中的大规模问题有关的领域。但是,HPC系统的计算基础架构也可能被滥用,正如最近在黑市上云计算资源的商品化所证明的那样。作为解决此问题的第一步,我们介绍了一种基于学习节点之间的通信模式对分布式并行计算进行分类的机器学习方法。我们首先提供有关消息传递和称为“矮人”的计算等价类的相关背景,并使用自组织映射描述我们的探索性数据分析。然后,我们使用贝叶斯网络在29个科学代码中显示分类结果,并将其与随机森林分类器的性能进行比较。这些模型经过劳伦斯·伯克利国家实验室(Lawrence Berkeley National Laboratory)收集的数百GB通讯日志的训练,在没有任何先验信息的情况下表现良好,并解决了以前方法的一些缺点。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号