机译:设计时间中毒有限地平线马尔可夫决策过程
Air Force Inst Technol Dept Operat Sci 2950 Hobson Way Wright Patterson AFB OH 45433 USA;
Air Force Inst Technol Dept Operat Sci 2950 Hobson Way Wright Patterson AFB OH 45433 USA;
Air Force Studies Anal & Assessments 1690 Air Force Pentagon Washington DC 20330 USA;
Markov decision process; Adversarial learning; Data poisoning; Machine learning; Reinforcement learning;