首页> 美国卫生研究院文献>other >Choice as a Function of Reinforcer Hold: From Probability Learning to Concurrent Reinforcement

【2h】

Choice as a Function of Reinforcer Hold: From Probability Learning to Concurrent Reinforcement

机译：选择增强保持功能：从概率学习到并行增强

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Two procedures commonly used to study choice are concurrent reinforcement and probability learning. Under concurrent-reinforcement procedures, once a reinforcer is scheduled, it remains available indefinitely until collected. Therefore reinforcement becomes increasingly likely with passage of time or responses on other operanda. Under probability learning, reinforcer probabilities are constant and independent of passage of time or responses. Therefore a particular reinforcer is gained or not, on the basis of a single response, and potential reinforcers are not retained, as when betting at a roulette wheel. In the “real” world, continued availability of reinforcers often lies between these two extremes, with potential reinforcers being lost owing to competition, maturation, decay, and random scatter. The authors parametrically manipulated the likelihood of continued reinforcer availability, defined as hold, and examined the effects on pigeons’ choices. Choices varied as power functions of obtained reinforcers under all values of hold. Stochastic models provided generally good descriptions of choice emissions with deviations from stochasticity systematically related to hold. Thus, a single set of principles accounted for choices across hold values that represent a wide range of real-world conditions.

机译：通常用于研究选择的两种程序是并发强化和概率学习。在并发加固程序下，一旦计划了加固器，它将无限期保持可用状态，直到被收集为止。因此，随着时间的流逝或对其他操作的回应，加强工作变得越来越有可能。在概率学习中，增强器的概率是恒定的，并且与时间或响应的经过无关。因此，根据单个响应获得或不获得特定的增强剂，并且如在轮盘赌上进行下注时一样，潜在的增强剂没有保留。在“现实”世界中，增强剂的持续可用性通常介于这两个极端之间，由于竞争，成熟，衰变和随机散布，潜在的增强剂会丢失。作者从参数上控制了增强剂持续供应的可能性（定义为保持），并研究了对鸽子选择的影响。在所有保持值下，选择随获得的增强器的功能而变化。随机模型通常提供对选择排放的良好描述，而偏离随机性的系统性则与持有有关。因此，一套原则解释了代表各种现实条件的保持值之间的选择。

著录项

期刊名称 other
作者
Greg Jensen; Allen Neuringer;
展开▼
作者单位

展开▼
年(卷),期 -1(34),4
年度 -1
页码 437–460
总页数 40
原文格式 PDF
正文语种
中图分类
关键词
matching stochastic response limited hold reinforcement probability pigeons;

机译：配对;随机反应;有限持球;增援概率;鸽子;

相似文献

外文文献
中文文献
专利

1. Choice as a Function of Reinforcer "Hold": From Probability Learning to Concurrent Reinforcement [J] . Jensen G, Neuringer A Journal of experimental psychology. Animal behavior processes . 2008,第4期

机译：增强器“保持”功能的选择：从概率学习到并行增强
2. A Reinforcement Learning Method Using a Dynamic Reinforcement Function Based on Action Selection Probability [J] . Yugo Hasegawa, Satoko Takada, Hidehiro Nakano, Systems and Computers in Japan . 2007,第7期

机译：基于动作选择概率的动态强化函数强化学习方法
3. Reinforcement Learning for Continuous Stochastic Actions: An Approximation of Probability Density Function by Orthogonal Wave Function Expansion [J] . Hideki SATOH IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences . 2006,第8期

机译：连续随机动作的强化学习：通过正交波函数展开的概率密度函数逼近
4. Reward Function and Initial Values: Better Choices for Accelerated Goal-Directed Reinforcement Learning [C] . Laeetitia Matignon, Guillaume J. Laurent, Nadine Le Fort-Piat International Conference on Artificial Neural Networks(ICANN 2006) pt.1; 20060910-14; Athens(GR) . 2006

机译：奖励功能和初始值：加速目标导向的强化学习的更好选择
5. Choice Dynamics in Concurrent Ratio Schedules of Reinforcement [D] . Bell-Garrison, Daniel. 2018

机译：并发配比计划中的选择动态
6. Concurrent schedules of interresponse time reinforcement: probability of reinforcement and the lower bounds of the reinforced interresponse time intervals [O] . Richard W. Malott, William W. Cumming 1966

机译：响应时间强化的并发时间表：强化的概率和强化的响应时间间隔的下限
7. Concurrent schedules of interresponse time reinforcement: probability of reinforcement and the lower bounds of the reinforced interresponse time intervals1 [O] . Malott, Richard W., Cumming, William W. 1966

机译：并发响应时间强化的并发时间表：强化的可能性和强化的响应时间间隔的下限1
8. Choice Probabilities and Choice Functions. [R] . Fishburn, P. C. 1978

机译：选择概率和选择函数。

Choice as a Function of Reinforcer Hold: From Probability Learning to Concurrent Reinforcement

摘要

著录项

相似文献

相关主题

期刊订阅