首页> 中文期刊> 《计算机科学技术学报:英文版》 >AquaSee: Predict Load and Cooling System Faults of Supercomputers Using Chilled Water Data

AquaSee: Predict Load and Cooling System Faults of Supercomputers Using Chilled Water Data

         

摘要

An analysis of real-world operational data of Tianhe-1A(TH-1A)supercomputer system shows that chilled water data not only can reflect the status of a chiller system but also are related to supercomputer load.This study proposes AquaSee,a method that can predict the load and cooling system faults of supercomputers by using chilled water pressure and temperature data.This method is validated on the basis of real-world operational data of the TH-1A supercomputer system at the National Supercomputer Center in Tianjin.Datasets with various compositions are used to construct the prediction model,which is also established using different prediction sequence lengths.Experimental results show that the method that uses a combination of pressure and temperature data performs more effectively than that only consisting of either pressure or temperature data.The best inference sequence length is two points.Furthermore,an anomaly monitoring system is set up by using chilled water data to help engineers detect chiller system anomalies.

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号