首页> 美国卫生研究院文献>other >Saccade selection when reward probability is dynamically manipulated using Markov chains

【2h】

Saccade selection when reward probability is dynamically manipulated using Markov chains

机译：使用马尔可夫链动态操纵奖励概率时的扫视选择

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Markov chains (stochastic processes where probabilities are assigned based on the previous outcome) are commonly used to examine the transitions between behavioral states, such as those that occur during foraging or social interactions. However, relatively little is known about how well primates can incorporate knowledge about Markov chains into their behavior. Saccadic eye movements are an example of a simple behavior influenced by information about probability, and thus are good candidates for testing whether subjects can learn Markov chains. In addition, when investigating the influence of probability on saccade target selection, the use of Markov chains could provide an alternative method that avoids confounds present in other task designs. To investigate these possibilities, we evaluated human behavior on a task in which stimulus reward probabilities were assigned using a Markov chain. On each trial, the subject selected one of four identical stimuli by saccade; after selection, feedback indicated the rewarded stimulus. Each session consisted of 200–600 trials, and on some sessions, the reward magnitude varied. On sessions with a uniform reward, subjects (n = 6) learned to select stimuli at a frequency close to reward probability, which is similar to human behavior on matching or probability classification tasks. When informed that a Markov chain assigned reward probabilities, subjects (n = 3) learned to select the greatest reward probability more often, bringing them close to behavior that maximizes reward. On sessions where reward magnitude varied across stimuli, subjects (n = 6) demonstrated preferences for both greater reward probability and greater reward magnitude, resulting in a preference for greater expected value (the product of reward probability and magnitude). These results demonstrate that Markov chains can be used to dynamically assign probabilities that are rapidly exploited by human subjects during saccade target selection.

机译：马尔可夫链（基于先前结果分配概率的随机过程）通常用于检查行为状态之间的转换，例如在觅食或社交互动中发生的状态。但是，关于灵长类如何将有关马尔可夫链的知识纳入其行为的知之甚少。眼跳运动是受概率信息影响的简单行为的示例，因此是测试受试者是否可以学习马尔可夫链的良好候选者。此外，在调查概率对扫视目标选择的影响时，使用马尔可夫链可以提供一种替代方法，避免其他任务设计中存在的混淆。为了研究这些可能性，我们评估了一项任务的人类行为，在该任务中，使用马尔可夫链分配了激励奖励概率。在每个试验中，受试者通过扫视从四个相同的刺激中选择一个。选择后，反馈表明奖励的刺激。每个环节包括200-600次试验，在某些环节上，奖励幅度各不相同。在具有统一奖励的会话中，受试者（n = 6）学会了以接近奖励概率的频率选择刺激，这类似于人类在匹配或概率分类任务上的行为。当被告知马尔可夫链分配了奖励概率时，受试者（n = 3）学会了更频繁地选择最大奖励概率，使他们接近最大化奖励的行为。在奖励幅度随刺激而变化的会议上，受试者（n = 6）表现出对更大奖励概率和更大奖励幅度的偏好，从而导致对更大期望值（奖励概率与幅度的乘积）的偏好。这些结果表明，马尔可夫链可用于动态分配概率，这些概率在扫视目标选择过程中被人类对象迅速利用。

著录项

期刊名称 other
作者
Samuel U. Nummela; Lee P. Lovejoy; Richard J. Krauzlis;
展开▼
作者单位

展开▼
年(卷),期 -1(187),2
年度 -1
页码 321–330
总页数 20
原文格式 PDF
正文语种
中图分类
关键词
Eye movement Saccade Selection Markov Probability;

机译：眼动;扫视;选择;马尔可夫;概率;

相似文献

外文文献
中文文献
专利

1. Saccade selection when reward probability is dynamically manipulated using Markov chains [J] . Samuel U. Nummela, Lee P. Lovejoy, Richard J. Krauzlis Experimental Brain Research . 2008,第2期

机译：使用马尔可夫链动态操纵奖励概率时的扫视选择
2. Saccade selection when reward probability is dynamically manipulated using Markov chains [J] . Samuel U. Nummela, Lee P. Lovejoy, Richard J. Krauzlis Experimental Brain Research . 2008,第2期

机译：使用马尔可夫链动态操纵奖励概率时的扫视选择
3. Some Reward Paths in Semi-Markov Models with Stochastic Selection of the Transition Probabilities [J] . Aleka Papadopoulou, George Tsaklidis Methodology and Computing in Applied Probability . 2007,第3期

机译：具有转移概率随机选择的半马尔可夫模型中的一些奖励路径
4. Dynamic Selection of Mining Pool with Different Reward Sharing Strategy in Blockchain Networks [C] . Chengzhen Xu, Kun Zhu, Ran Wang, IEEE International Conference on Communications . 2020

机译：区块链网络中具有不同奖励共享策略的矿池动态选择
5. DYNAMIC PROBABILISTIC SYSTEMS WITH CONTINUOUS PARAMETER MARKOV CHAINS AND SEMI-MARKOV PROCESSES. [D] . LEE, CHRISTOPHER TIN HTUN. 1973

机译：具有连续参数马尔可夫链和半马尔可夫过程的动态概率系统。
6. A closed description of the non-autonomous dynamics for an absorbing Markov chain with three states and random transition probabilities [O] . V. I. Teslenko, O. L. Kapitanchuk 2019

机译：具有三种状态和随机转换概率的吸收马尔可夫链的非自治动态的关闭描述
7. Blackwell Optimality in the Class of All Policies in Markov Decision Chains witha Borel State Space and Unbounded Rewards [R] . Hordijk, A., Yushkevich, A. A. 2000

机译：具有Borel状态空间和无界奖励的马尔可夫决策链中所有策略类的Blackwell最优性

Saccade selection when reward probability is dynamically manipulated using Markov chains

摘要

著录项

相似文献

相关主题

期刊订阅