A temporal saliency map for modeling auditory attention

机译：用于建模听觉注意的时间显着图

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The auditory system is flooded with information throughout our daily lives. Rather than processing all of this information, we selectively shift our attention to various auditory events - either events of interest (top-down attention) or events that capture our attention exogenously (bottom-up). In this work, we are concerned with aspects of human attention that are bottom-up stimulus-driven. Saliency of an auditory event is measured by how much the event differs from the surrounding sounds that precede it in time. To calculate this, we propose a novel auditory saliency map that is defined only over time. The proposed model is contrasted against previously published auditory saliency maps which treat the two-dimensional auditory time-frequency spectrogram as an image that can be analyzed using visual saliency models. Instead, our proposed model capitalizes on the rich high-dimensional feature space that defines auditory events; where each acoustic dimension is processed across multiple scales. These normalized feature maps are then combined over time into a single temporal saliency map. The peaks of the temporal saliency map indicate the locations of the salient events in the auditory scene. We validate the accuracy of the proposed model in simulated test scenarios of simple and complex sound clips. By exploiting the unique aspects of auditory processing that cannot be readily captured by visual processes, we are able to outperform other auditory saliency models; all while highlighting the commonalities and differences between the two modalities in processing salient events in everyday scenes.

机译：在我们的日常生活中，听觉系统充斥着信息。我们而不是处理所有这些信息，我们选择性地将我们的注意力转移到各种听觉事件 - 感兴趣的事件（自上而下的注意）或捕捉我们引起的事件（自下而上）。在这项工作中，我们关注人类注意的方面，即自下而上刺激驱动。听觉事件的显着性是通过与周围的声音不同的情况来衡量的。要计算出来，我们提出了一种仅限时间定义的新型听觉显着性图。所提出的模型与先前公布的听觉显着性图形成鲜明对比，其将二维听觉时频谱图视为可以使用视觉显着模型分析的图像。相反，我们的拟议模型大写了定义听觉事件的丰富的高维特征空间;在多个尺度上处理每个声尺寸的地方。然后将这些归一化特征贴图随时间结合到单个时间显着图中。时间显着图的峰值表示听觉场景中的突出事件的位置。我们验证了简单和复杂的声音剪辑的模拟测试场景中提出模型的准确性。通过利用可通过视觉过程易于捕获的听觉处理的独特方面，我们能够优于其他听效刻;虽然突出了在日常场景中处理突出事件中的两个方式之间的共同点和差异。

著录项

来源
《Annual Conference on Information Sciences and Systems》|2012年||共6页
会议地点
作者
Kaya Emine Merve; Elhilali Mounya;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 G20-53;
关键词

相似文献

外文文献
中文文献
专利

1. 用于跨库语音情感识别的时频原子听觉注意模型 [J] . 张昕然, 宋鹏, 查诚, 东南大学学报（英文版） . 2016,第004期
2. Mechanisms for allocating auditory attention: an auditory saliency map [J] . Kayser C, Petkov CI, Lippert M, Current Biology: CB . 2005,第21期

机译：分配听觉注意力的机制：听觉显着图
3. A computational model of visual attention based on saliency maps [J] . Hang Shi, Yu Yang Applied mathematics and computation . 2007,第2期

机译：基于显着性图的视觉注意力计算模型
4. Visual Attention Modeling in Compressed Domain:From Image Saliency Detection to Video Saliency Detection [J] . FANG Yuming, ZHANG Xiaoqiang 中兴通讯技术（英文版） . 2019,第001期

机译：压缩域中的视觉注意力建模：从图像显着性检测到视频显着性检测
5. A temporal saliency map for modeling auditory attention [C] . Kaya Emine Merve, Elhilali Mounya Annual Conference on Information Sciences and Systems;CISS . 2012

机译：用于建模听觉注意力的时间显着图
6. Estimation of the Temporal Response Function and Tracking Selective Auditory Attention Using Deep Kalman Filter [D] . Cao, Yexin. 2020

机译：使用Deep Kalman滤波器估计时间响应函数和跟踪选择性听觉注意力
7. The ups and downs of temporal orienting: a review of auditory temporal orienting studies and a model associating the heterogeneous findings on the auditory N1 with opposite effects of attention and prediction [O] . Kathrin Lange 2013

机译：时间取向的起伏：听觉时间取向研究的回顾和将听觉N1异质发现与注意力和预测的相反影响相关联的模型
8. Mechanisms for Allocating Auditory Attention: An Auditory Saliency Map [O] . Kayser Christoph, Petkov Christopher I., Lippert Michael, 2005

机译：听觉注意力分配机制：听觉显着图

A temporal saliency map for modeling auditory attention

摘要

著录项

相似文献

相关主题

期刊订阅