图像处理器 (GPU) 集群因其高性能的特性而被广泛应用, 但随着GPU规模的增大, 其高功耗问题会降低系统的可靠性.为此, 提出一种GPU集群功耗收集系统, 并设计基于ZigBee无线传感器网络的GPU集群功耗收集监控网络, 同时构建收集通信协议和数据库存储系统, 通过运行该系统可有效避免通信冲突.实验结果表明, 该监控系统可以精确地测量集群中各个GPU的功耗, 系统测量误差和丢包率分别低于1%和0.005%.%Graphics Processing Unit (GPU) clusters are widely used for their high performance characteristics, but with the enlargement of GPU scale, their high power consumption reduce system reliability.Therefore, a GPU cluster power consumption collection system is proposed, and a GPU cluster power collection and monitoring network based on ZigBee Wireless Sensor Network (WSN) is designed.At the same time, a collection communication protocol and a database storage system are constructed.By running the system, communication conflicts can be effectively avoided.Experimental results show that the monitoring network can accurately collect the power consumption of each GPU in the cluster, and the system measurement error and packet loss rate are less than 1% and 0.005% respectively.
展开▼