...
首页> 外文期刊>Journal of vision >Consistent frequency-based sound matches to natural visual scenes
【24h】

Consistent frequency-based sound matches to natural visual scenes

机译:一致的基于频率的声音匹配自然的视觉场景

获取原文
           

摘要

We previously demonstrated a consistent relationship between visual spatial-frequency and auditory amplitude-modulation (AM) frequency, in which Gabors of 0.5a??8 cycles/degree (c/d) were linearly matched to auditory AM frequencies of 1a??12 Hz (Guzman et al., VSS 2009). Here, we investigated whether similar crossmodal associations occur for natural scenes, which are dominated by various spatial-frequency components. We asked whether people consistently match specific auditory AM frequencies to photographed scenes, and if so, how these crossmodal matches are associated with the dominant spatial-frequency component and subjective impression (dense, stimulating) of the scene. We found that eighteen observers matched specific auditory AM frequencies to 26 scenes from diverse categories (nature, urban, indoor) with surprising consistency. We applied a 2D Fourier transform to each scene to determine the contrast energy for 12 spatial-frequency bins ranging 0.05a??12.8 c/d. Interestingly, scenes with higher contrast energy in the mid-spatial-frequency range 0.5a??1.25 c/d were matched to faster AM frequencies, whereas other spatial-frequency components did not contribute to AM frequency matches. Analysis of our images suggests that scenes with stronger mid-spatial-frequency components appear to have numerous object boundaries. Thus, the results suggest a crossmodal association between the visual coding of multiple object boundaries and the auditory coding of AM frequency. Furthermore, a multiple regression of AM frequency matches to subjective scene ratings (obtained after the experiment) indicates that dense (vs. sparse) and stimulating (vs. calm) ratings independently contribute to faster AM frequency matches. Based on the spatial-frequency analysis and subjective ratings, our results demonstrate an association between visual object density and faster auditory AM frequencies in scene perception, and that visual features conveying stimulating content additionally contribute to faster AM frequency matches.
机译:先前我们证明了视觉空间频率与听觉振幅调制(AM)频率之间的一致性关系,其中0.5a ?? 8个循环/度(c / d)的Gabors与1a ?? 12的听觉AM频率线性匹配Hz(Guzman等,VSS 2009)。在这里,我们调查了自然场景是否发生了类似的交叉峰关联,而自然场景受各种空间频率分量的支配。我们询问人们是否始终将特定的听觉AM频率与拍摄的场景相匹配,如果是,这些交叉模态匹配如何与场景的主要空间频率成分和主观印象(密集,刺激)相关联。我们发现,十八名观察员以令人惊讶的一致性将特定的听觉AM频率与来自不同类别(自然,城市,室内)的26个场景进行了匹配。我们对每个场景应用了二维傅立叶变换,以确定12个空间频率范围为0.05a≤12.8c / d的仓的对比度能量。有趣的是,在中空频率范围0.5a ?? 1.25 c / d中具有较高对比度能量的场景与较快的AM频率相匹配,而其他空间频率分量对AM频率匹配无贡献。对图像的分析表明,具有较高中频成分的场景似乎具有许多对象边界。因此,结果表明在多个物体边界的视觉编码和AM频率的听觉编码之间存在交叉模式关联。此外,AM频率匹配与主观场景评级(在实验后获得)的多元回归表明,密集(相对于稀疏)和刺激(相对于平静)评级独立地有助于更快的AM频率匹配。基于空间频率分析和主观评级,我们的结果表明视觉对象密度与场景感知中更快的听觉AM频率之间存在关联,并且传达刺激性内容的视觉特征还有助于实现更快的AM频率匹配。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号