首页> 美国卫生研究院文献>PLoS Computational Biology >A hierarchical sparse coding model predicts acoustic feature encoding in both auditory midbrain and cortex
【2h】

A hierarchical sparse coding model predicts acoustic feature encoding in both auditory midbrain and cortex

机译:分层稀疏编码模型可预测听觉中脑和皮层的声学特征编码

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The auditory pathway consists of multiple stages, from the cochlear nucleus to the auditory cortex. Neurons acting at different stages have different functions and exhibit different response properties. It is unclear whether these stages share a common encoding mechanism. We trained an unsupervised deep learning model consisting of alternating sparse coding and max pooling layers on cochleogram-filtered human speech. Evaluation of the response properties revealed that computing units in lower layers exhibited spectro-temporal receptive fields (STRFs) similar to those of inferior colliculus neurons measured in physiological experiments, including properties such as sound onset and termination, checkerboard pattern, and spectral motion. Units in upper layers tended to be tuned to phonetic features such as plosivity and nasality, resembling the results of field recording in human auditory cortex. Variation of the sparseness level of the units in each higher layer revealed a positive correlation between the sparseness level and the strength of phonetic feature encoding. The activities of the units in the top layer, but not other layers, correlated with the dynamics of the first two formants (F1, F2) of all phonemes, indicating the encoding of phoneme dynamics in these units. These results suggest that the principles of sparse coding and max pooling may be universal in the human auditory pathway.
机译:听觉途径包括从耳蜗核到听觉皮层的多个阶段。在不同阶段起作用的神经元具有不同的功能并表现出不同的响应特性。尚不清楚这些阶段是否共享通用的编码机制。我们训练了一个无监督的深度学习模型,该模型由在经耳蜗图过滤的人类语音上交替稀疏编码和最大池化层组成。对响应特性的评估表明,较低层中的计算单元表现出的光谱时空接受域(STRF)与生理实验中测得的下丘神经元的相似,包括诸如声音起跳和终止,棋盘图案和频谱运动的属性。上层的单位倾向于被调整为语音功能,如单元格和鼻音,类似于人类听觉皮层中的现场记录结果。每个较高层中单元的稀疏度的变化表明稀疏度与语音特征编码的强度之间呈正相关。顶层(而不是其他层)中单元的活动与所有音素的前两个共振峰(F1,F2)的动力学相关,指示这些单元中音素动力学的编码。这些结果表明,稀疏编码和最大合并的原理在人类听觉途径中可能是普遍的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号