Beijing Key Laboratory of Work Safety Intelligent Monitoring,Beijing University of Posts and Telecommunications,Beijing,100876,China;
Alibaba Cloud Computing,Hangzhou,311121,China;
Department of Systems and Computer Engineering,Carleton University,Ottawa,K1S 5B6,Canada;
D2D communication; resource allocation; power control; multi-agent; Q-learning; fuzzy C-means;