A temporal saliency map for modeling auditory attention

机译：用于建模听觉注意力的时间显着图

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The auditory system is flooded with information throughout our daily lives. Rather than processing all of this information, we selectively shift our attention to various auditory events - either events of interest (top-down attention) or events that capture our attention exogenously (bottom-up). In this work, we are concerned with aspects of human attention that are bottom-up stimulus-driven. Saliency of an auditory event is measured by how much the event differs from the surrounding sounds that precede it in time. To calculate this, we propose a novel auditory saliency map that is defined only over time. The proposed model is contrasted against previously published auditory saliency maps which treat the two-dimensional auditory time-frequency spectrogram as an image that can be analyzed using visual saliency models. Instead, our proposed model capitalizes on the rich high-dimensional feature space that defines auditory events; where each acoustic dimension is processed across multiple scales. These normalized feature maps are then combined over time into a single temporal saliency map. The peaks of the temporal saliency map indicate the locations of the salient events in the auditory scene. We validate the accuracy of the proposed model in simulated test scenarios of simple and complex sound clips. By exploiting the unique aspects of auditory processing that cannot be readily captured by visual processes, we are able to outperform other auditory saliency models; all while highlighting the commonalities and differences between the two modalities in processing salient events in everyday scenes.

机译：在我们的日常生活中，听觉系统充斥着各种信息。而不是处理所有这些信息，我们选择性地将注意力转移到各种听觉事件上-感兴趣的事件（自上而下的注意力）或外源性地捕获我们的注意力的事件（自下而上）。在这项工作中，我们关注的是自下而上的刺激驱动的人类注意力方面。听觉事件的显着性通过事件与时间之前的周围声音有多大差异来衡量。为了计算这一点，我们提出了一种新颖的听觉显着图，该图仅随时间而定义。所提出的模型与先前发布的听觉显着图进行了对比，听觉显着图将二维听觉时频频谱图视为可以使用视觉显着模型进行分析的图像。相反，我们提出的模型利用了定义听觉事件的丰富高维特征空间;在每个尺度上处理多个尺度的声音。然后将这些归一化特征图随时间组合为单个时间显着图。时间显着性图的峰值指示了突出事件在听觉场景中的位置。我们在简单和复杂的声音片段的模拟测试场景中验证了所提出模型的准确性。通过利用视觉过程无法轻易捕获的听觉处理的独特方面，我们能够胜过其他听觉显着性模型;所有这些都强调了在处理日常场景中的显着事件时两种方式之间的共性和差异。

著录项

来源
《Annual Conference on Information Sciences and Systems;CISS》|2012年|p.1- 6|共6页
会议地点
作者
Kaya Emine Merve; Elhilali Mounya;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类信息与传播理论;
关键词

相似文献

外文文献
中文文献
专利

1. Mechanisms for allocating auditory attention: an auditory saliency map [J] . Kayser C, Petkov CI, Lippert M, Current Biology: CB . 2005,第21期

机译：分配听觉注意力的机制：听觉显着图
2. A computational model of visual attention based on saliency maps [J] . Hang Shi, Yu Yang Applied mathematics and computation . 2007,第2期

机译：基于显着性图的视觉注意力计算模型
3. Visual Attention Modeling in Compressed Domain:From Image Saliency Detection to Video Saliency Detection [J] . FANG Yuming, ZHANG Xiaoqiang 中兴通讯技术（英文版） . 2019,第001期

机译：压缩域中的视觉注意力建模：从图像显着性检测到视频显着性检测
4. A temporal saliency map for modeling auditory attention [C] . Kaya Emine Merve, Elhilali Mounya Annual Conference on Information Sciences and Systems . 2012

机译：用于建模听觉注意的时间显着图
5. Estimation of the Temporal Response Function and Tracking Selective Auditory Attention Using Deep Kalman Filter [D] . Cao, Yexin. 2020

机译：使用Deep Kalman滤波器估计时间响应函数和跟踪选择性听觉注意力
6. The ups and downs of temporal orienting: a review of auditory temporal orienting studies and a model associating the heterogeneous findings on the auditory N1 with opposite effects of attention and prediction [O] . Kathrin Lange 2013

机译：时间取向的起伏：听觉时间取向研究的回顾和将听觉N1异质发现与注意力和预测的相反影响相关联的模型
7. Mechanisms for Allocating Auditory Attention: An Auditory Saliency Map [O] . Kayser Christoph, Petkov Christopher I., Lippert Michael, 2005

机译：听觉注意力分配机制：听觉显着图

A temporal saliency map for modeling auditory attention

摘要

著录项

相似文献

相关主题

期刊订阅