机译:利用现有知识的奖励塑造,无人驾驶障碍避免避免
Shanghai Univ Sch Comp Engn & Sci Shanghai Peoples R China;
Shanghai Univ Sch Comp Engn & Sci Shanghai Peoples R China;
Shanghai Univ Sch Comp Engn & Sci Shanghai Peoples R China;
Shanghai Univ Sch Comp Engn & Sci Shanghai Peoples R China;
complex dynamic environment; obstacle avoidance; prior knowledge; reinforcement learning; reward shaping; unmanned surface vessel;