Learning machines; Multiagent systems; Algorithms; Behavior; Bias; Classification; Decision making; Feedback; Information retrieval; Man computer interface; Markov processes; Mathematical models; Network architecture; Probability; Semantics; Signal processing; Training; User needs; Reinforcement learning; Modeling user behavior; End-user programming; Human-agent interactions; Interactive machine learning; Human teachers;
机译:从人的奖励中构筑强化学习:奖励积极性,暂时性打折,流行和表现
机译:平均对折奖励时间差异学习
机译:人类远见的可分离元素:腹侧额叶在构筑未来时发挥作用,但在降低未来收益中不起作用。
机译:从人的奖励中强化学习:情节任务的折扣
机译:用于模拟情景记忆的新型神经体系结构:远距奖励学习的计算研究。
机译:人体纹状体奖励价值的短期时间折现
机译:从人类奖励中学习强化学习:奖励积极性,时间贴现,情节性和表现