首页> 外文期刊>Frontiers in Psychology >Understanding Human Decision-Making in an Interactive Landslide Simulator Tool via Reinforcement Learning
【24h】

Understanding Human Decision-Making in an Interactive Landslide Simulator Tool via Reinforcement Learning

机译:通过加固学习了解交互式滑坡模拟器工具中的人为决策

获取原文
           

摘要

Prior research has used an Interactive Landslide Simulator (ILS) tool to investigate human decision-making against landslide risks. It has been found that repeated feedback in ILS tool about damages due to landslides causes an improvement in human decisions against landslide risks. However, little is known on how theories of learning from feedback (e.g., reinforcement learning) would account for human decisions in the ILS tool. The primary goal of this paper is to account for human decisions in the ILS tool via computational models based upon reinforcement learning and to investigate the underlying cognitive processes involved when people make decisions in the ILS tool. Four different reinforcement-learning models were developed and evaluated in their ability in capturing human decisions in an experiment involving two conditions in the ILS tool. The parameters of an Expectancy-Valence (EV) model, two Prospect-Valence-Learning models (PVL and PVL-2), a combination EV-PU model, and a random model were calibrated to human decisions in the ILS tool across the two conditions. Later, different models with their calibrated parameters were generalized to data collected in an experiment involving a new condition in ILS. When generalized to this new condition, the PVL-2 model’s parameters of both damage-feedback conditions outperformed all other RL models (including the random model). We highlight the implications of our results for decision-making against landslide risks.
机译:现有研究使用了交互式滑坡模拟器(ILS)工具来调查人力抵抗滑坡风险。有人发现,ILS工具中的反复反馈关于由于山体滑坡导致的损坏导致对抗滑坡风险的人类决策的改善。然而,对于从反馈(例如,强化学习)的学习理论知之甚少,都将占ILS工具中的人类决策。本文的主要目标是通过基于加强学习的计算模型,并根据人们在ILS工具中做出决定时,通过计算模型来解释ILS工具中的人为决定。在涉及ILS工具中的两个条件的实验中,开发了四种不同的增强学习模型和评估其能力。期望值(EV)模型,两个前景 - 价学习模型(PVL和PVL-2),组合EV-PU模型和随机模型的参数被校准到两者的ILS工具中的人为决定状况。后来,具有校准参数的不同模型是推广到涉及ILS新条件的实验中收集的数据。当广义到这种新条件时,PVL-2模型的损坏反馈条件的参数占所有其他RL模型(包括随机模型)。我们突出了我们对山体滑坡风险的决策结果的影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号