Centre for Quantum Software and Information;
University of Technology Sydney;
NSW 2007;
Australia;
State Key Laboratory of Computer Science;
Institute of Software;
Chinese Academy of Sciences;
Beijing 100190;
China;
Department of Computer Science and Technology;
Tsinghua University;
Beijing 100084;
China;
Quantum Markov decision processes; quantum machine learning; reinforcement learning; dynamic programming; decision making;