首页> 美国卫生研究院文献>other >Probability matching in perceptrons: Effects of conditional dependence and linear nonseparability
【2h】

Probability matching in perceptrons: Effects of conditional dependence and linear nonseparability

机译:感知器中的概率匹配:条件依赖和线性不可分的影响

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Probability matching occurs when the behavior of an agent matches the likelihood of occurrence of events in the agent’s environment. For instance, when artificial neural networks match probability, the activity in their output unit equals the past probability of reward in the presence of a stimulus. Our previous research demonstrated that simple artificial neural networks (perceptrons, which consist of a set of input units directly connected to a single output unit) learn to match probability when presented different cues in isolation. The current paper extends this research by showing that perceptrons can match probabilities when presented simultaneous cues, with each cue signaling different reward likelihoods. In our first simulation, we presented up to four different cues simultaneously; the likelihood of reward signaled by the presence of one cue was independent of the likelihood of reward signaled by other cues. Perceptrons learned to match reward probabilities by treating each cue as an independent source of information about the likelihood of reward. In a second simulation, we violated the independence between cues by making some reward probabilities depend upon cue interactions. We did so by basing reward probabilities on a logical combination (AND or XOR) of two of the four possible cues. We also varied the size of the reward associated with the logical combination. We discovered that this latter manipulation was a much better predictor of perceptron performance than was the logical structure of the interaction between cues. This indicates that when perceptrons learn to match probabilities, they do so by assuming that each signal of a reward is independent of any other; the best predictor of perceptron performance is a quantitative measure of the independence of these input signals, and not the logical structure of the problem being learned.
机译:当业务代表的行为与业务代表环境中事件发生的可能性相匹配时,便发生概率匹配。例如,当人工神经网络匹配概率时,其输出单元中的活动等于存在刺激时过去的奖励概率。我们以前的研究表明,简单的人工神经网络(感知器,由直接连接到单个输出单元的一组输入单元组成)在孤立地呈现不同线索时会学习匹配概率。本论文通过显示感知器可以在同时提示时匹配概率,每个提示都表示不同的奖励可能性。在我们的第一个模拟中,我们同时呈现了多达四个不同的线索。由一个提示信号提示的奖励可能性与其他提示信号提示的可能性无关。感知器通过将每个提示作为有关奖励可能性的独立信息源来学习以匹配奖励概率。在第二个模拟中,我们通过使某些奖励概率依赖于提示交互来违反提示之间的独立性。我们通过将奖励概率基于四个可能线索中的两个线索的逻辑组合(AND或XOR)来实现。我们还改变了与逻辑组合相关的奖励的大小。我们发现,与提示之间相互作用的逻辑结构相比,后一种操作是感知器性能更好的预测指标。这表明当感知器学习匹配概率时,它们通过假设奖励的每个信号彼此独立来进行匹配。感知器性能的最佳预测指标是这些输入信号的独立性的定量度量,而不是所学问题的逻辑结构。

著录项

  • 期刊名称 other
  • 作者单位
  • 年(卷),期 -1(12),2
  • 年度 -1
  • 页码 e0172431
  • 总页数 13
  • 原文格式 PDF
  • 正文语种
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号