...
首页> 外文期刊>Frontiers in Psychology >Understanding Human Decision Making in an Interactive Landslide Simulator Tool via Reinforcement Learning
【24h】

Understanding Human Decision Making in an Interactive Landslide Simulator Tool via Reinforcement Learning

机译:通过加固学习了解互动滑坡模拟器工具的人为决策

获取原文
           

摘要

Prior research has used an Interactive Landslide Simulator (ILS) tool to investigate human decision making against landslide risks. It has been found that repeated feedback in the ILS tool about damages due to landslides causes an improvement in human decisions against landslide risks. However, little is known on how theories of learning from feedback (e.g., reinforcement learning) would account for human decisions in the ILS tool. The primary goal of this paper is to account for human decisions in the ILS tool via computational models based upon reinforcement learning and to explore the model mechanisms involved when people make decisions in the ILS tool. Four different reinforcement-learning models were developed and evaluated in their ability to capture human decisions in an experiment involving two conditions in the ILS tool. The parameters of an Expectancy-Valence (EV) model, two Prospect-Valence-Learning models (PVL and PVL-2), a combination EV-PU model, and a random model were calibrated to human decisions in the ILS tool across the two conditions. Later, different models with their calibrated parameters were generalized to data collected in an experiment involving a new condition in ILS. When generalized to this new condition, the PVL-2 model’s parameters of both damage-feedback conditions outperformed all other RL models (including the random model). We highlight the implications of our results for decision making against landslide risks.
机译:现有研究使用了一个交互式滑坡模拟器(ILS)工具来调查对滑坡风险的人为决策。已经发现,ILS工具中的反复反馈关于由于山体滑坡导致的损坏导致对滑坡风险的人类决策的改善。然而,对于如何从反馈(例如,强化学习)的学习理论知之甚少,将占ILS工具中的人类决策。本文的主要目标是通过基于加强学习的计算模型来解释ILS工具中的人为决策,并探索人们在ILS工具中做出决定时所涉及的模型机制。开发了四种不同的增强学习模型,并在其在涉及ILS工具中的两个条件的实验中捕获人类决策的能力。期望值(EV)模型的参数,两个展望价学习模型(PVL和PVL-2),组合EV-PU模型和随机模型被校准到两者的ILS工具中的人类决策条件。后来,具有校准参数的不同模型是推广到在涉及ILS新条件的实验中收集的数据。当广义到这种新条件时,PVL-2模型的损坏反馈条件的参数占所有其他RL模型(包括随机模型)。我们突出了我们对抗滑坡风险的决策结果的影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号