Auditory Sketches: Sparse Representations of Sounds Based on Perceptual Models

机译：听觉草图：基于感知模型的声音稀疏表示

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

An important question for both signal processing and auditory science is to understand which features of a sound carry the most important information for the listener. Here we approach the issue by introducing the idea of "auditory sketches": sparse representations of sounds, severely impoverished compared to the original, which nevertheless afford good performance on a given perceptual task. Starting from biologically-grounded representations (auditory models), a sketch is obtained by reconstructing a highly under-sampled selection of elementary atoms. Then, the sketch is evaluated with a psychophysical experiment involving human listeners. The process can be repeated iteratively. As a proof of concept, we present data for an emotion recognition task with short non-verbal sounds. We investigate 1/ the type of auditory representation that can be used for sketches 2/ the selection procedure to sparsify such representations 3/ the smallest number of atoms that can be kept 4/ the robustness to noise. Results indicate that it is possible to produce recognizable sketches with a very small number of atoms per second. Furthermore, at least in our experimental setup, a simple and fast under-sampling method based on selecting local maxima of the representation seems to perform as well or better than a more traditional algorithm aimed at minimizing the reconstruction error. Thus, auditory sketches may be a useful tool for choosing sparse dictionaries, and also for identifying the minimal set of features required in a specific perceptual task.

机译：信号处理和听觉科学的一个重要问题是要了解声音的哪个功能为侦听器提供最重要的信息。在这里，我们通过引入“听觉草图”的想法来解决问题：声音的稀疏表示，与原件相比严重贫困，但在给定的感知任务上提供了良好的表现。从生物接地的表示（听觉模型）开始，通过重建高度取样的基本原子选择来获得草图。然后，用涉及人类听众的心理物理实验评估草图。可以迭代地重复该过程。作为概念证明，我们为情感识别任务提供了短的非口头声音的数据。我们调查1 /可用于草图2 /选择过程的听觉表示的类型，以缩小这种表示的3 /最小的原子数，可以将4 /噪声稳健。结果表明，可以产生识别的草图，每秒具有非常少量的原子。此外，至少在我们的实验设置中，基于选择局部最大值的简单且快速的下式采样方法似乎也表现不佳或更好，而不是更传统的算法，该算法旨在最小化重建误差。因此，听觉草图可以是用于选择稀疏词典的有用工具，并且还用于识别特定感知任务所需的最小特征集。

著录项

来源
《International Symposium on Computer Music Modeling and Retrieval》|2013年||共17页
会议地点
作者
Clara Suied; Angelique Dremeau; Daniel Pressnitzer; Laurent Daudet;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP242.63-53;
关键词

相似文献

外文文献
中文文献
专利

1. Sound Retrieval and Ranking Using Sparse Auditory Representations [J] . Richard F. Lyon, Martin Rehn, Samy Bengio, Neural computation . 2010,第9期

机译：使用稀疏听觉表示进行声音检索和排序
2. Sparse Representation of Sounds in the Unanesthetized Auditory Cortex [J] . Tomá? Hromádka, Michael R DeWeese, Anthony M Zador PLoS Biology . 2008,第1期

机译：未麻醉的听觉皮层中声音的稀疏表示
3. Neural Representation of Concurrent Harmonic Sounds in Monkey Primary Auditory Cortex: Implications for Models of Auditory Scene Analysis [J] . Fishman Yonatan I., Steinschneider Mitchell, Micheyl Christophe The Journal of Neuroscience: The Official Journal of the Society for Neuroscience . 2014,第37期

机译：猴子初级听觉皮层并发谐波的神经表征：对听觉场景分析模型的启示
4. Auditory Sketches: Sparse Representations of Sounds Based on Perceptual Models [C] . Clara Suied, Angelique Dremeau, Daniel Pressnitzer, International Symposium on Computer Music Modeling and Retrieval . 2013

机译：听觉草图：基于感知模型的声音稀疏表示
5. Multi-color fluorescence in-situ hybridization (M-FISH) image analysis based on sparse representation models. [D] . Li, Jingyao. 2015

机译：基于稀疏表示模型的多色荧光原位杂交（M-FISH）图像分析。
6. Auditory Sketches: Very Sparse Representations of Sounds Are Still Recognizable [O] . Vincent Isnard, Marine Taffou, Isabelle Viaud-Delmon, -1

机译：听觉草图：声音的非常稀疏表示仍然可以识别
7. Auditory Sketches: Very Sparse Representations of Sounds Are Still Recognizable [O] . Isnard, Vincent, Taffou, Marine, Viaud-Delmon, Isabelle, 2016

机译：听觉草图：声音的非常稀疏表示仍然可以识别

Auditory Sketches: Sparse Representations of Sounds Based on Perceptual Models

摘要

著录项

相似文献

相关主题

期刊订阅