Robust Environmental Sound Recognition With Sparse Key-Point Encoding and Efficient Multispike Learning

Yu Qiang; Yao Yanli; Wang Longbiao; Tang Huajin; Dang Jianwu; Tan Kay Chen

首页> 外文期刊>Neural Networks and Learning Systems, IEEE Transactions on >Robust Environmental Sound Recognition With Sparse Key-Point Encoding and Efficient Multispike Learning

【24h】

Robust Environmental Sound Recognition With Sparse Key-Point Encoding and Efficient Multispike Learning

机译：强大的环境声音识别与稀疏关键点编码和高效的多分层学习

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The capability for environmental sound recognition (ESR) can determine the fitness of individuals in a way to avoid dangers or pursue opportunities when critical sound events occur. It still remains mysterious about the fundamental principles of biological systems that result in such a remarkable ability. Additionally, the practical importance of ESR has attracted an increasing amount of research attention, but the chaotic and nonstationary difficulties continue to make it a challenging task. In this article, we propose a spike-based framework from a more brain-like perspective for the ESR task. Our framework is a unifying system with consistent integration of three major functional parts which are sparse encoding, efficient learning, and robust readout. We first introduce a simple sparse encoding, where key points are used for feature representation, and demonstrate its generalization to both spike- and nonspike-based systems. Then, we evaluate the learning properties of different learning rules in detail with our contributions being added for improvements. Our results highlight the advantages of multispike learning, providing a selection reference for various spike-based developments. Finally, we combine the multispike readout with the other parts to form a system for ESR. Experimental results show that our framework performs the best as compared to other baseline approaches. In addition, we show that our spike-based framework has several advantageous characteristics including early decision making, small dataset acquiring, and ongoing dynamic processing. Our framework is the first attempt to apply the multispike characteristic of nervous neurons to ESR. The outstanding performance of our approach would potentially contribute to draw more research efforts to push the boundaries of spike-based paradigm to a new horizon.

机译：环境声音识别（ESR）的能力可以以一种方式确定个人的适应性，以避免危险或追求批判声音事件的危险。它仍然是关于生物系统的基本原则的神秘，导致这种显着的能力。此外，ESR的实际重要性引起了越来越多的研究关注，但混乱和非间断的困难继续使其成为一个具有挑战性的任务。在本文中，我们向ESR任务的更脑的角度提出了一种基于峰值的框架。我们的框架是一个统一系统，具有一致的三个主要功能部件，这是稀疏编码，高效学习和恢复读出的主要功能部件。我们首先介绍一个简单的稀疏编码，其中关键点用于特征表示，并展示其对基于尖峰和基于峰值的系统的概括。然后，我们详细评估了不同学习规则的学习属性，我们为改进添加了我们的贡献。我们的结果突出了多点学习的优势，为各种基于尖峰的发展提供选择参考。最后，我们将多点读数与其他部分相结合以形成ESR的系统。实验结果表明，与其他基线方法相比，我们的框架表现了最佳。此外，我们表明我们的峰值框架具有几个有利的特征，包括早期决策，小型数据集获取和正在进行的动态处理。我们的框架是第一次尝试将神经神经元的多点特征应用于ESR。我们的方法的突出表现可能会有助于提高更多的研究努力，将基于峰值范式的界限推向一个新的地平线。

著录项

来源
《Neural Networks and Learning Systems, IEEE Transactions on》 |2021年第2期|625-638|共14页
作者
Yu Qiang; Yao Yanli; Wang Longbiao; Tang Huajin; Dang Jianwu; Tan Kay Chen;
展开▼
作者单位

Tianjin Univ Coll Intelligence & Comp Tianjin Key Lab Cognit Comp & Applicat Tianjin 300350 Peoples R China;

Tianjin Univ Coll Intelligence & Comp Tianjin Key Lab Cognit Comp & Applicat Tianjin 300350 Peoples R China;

Tianjin Univ Coll Intelligence & Comp Tianjin Key Lab Cognit Comp & Applicat Tianjin 300350 Peoples R China;

Zhejiang Univ Coll Comp Sci & Technol Hangzhou 610065 Peoples R China;

Tianjin Univ Coll Intelligence & Comp Tianjin Key Lab Cognit Comp & Applicat Tianjin 300350 Peoples R China;

City Univ Hong Kong Dept Comp Sci Hong Kong Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Encoding; Task analysis; Hidden Markov models; Neurons; Biological neural networks; Mel frequency cepstral coefficient; Biological information theory; Brain-like processing; feature extraction; multispike learning; neuromorphic computing; robust sound recognition; spike encoding; spiking neural networks (SNNs);

机译：编码;任务分析;隐马尔可夫模型;神经元;生物学神经网络;生物信息理论;脑状的加工;特征提取;多点学习;神经形态计算;稳健的声音识别;尖峰编码;尖峰编码;飙升神经网络（SNNS ）;

相似文献

外文文献
中文文献
专利

1. Spike-based encoding and learning of spectrum features for robust sound recognition [J] . Xiao Rong, Tang Huajin, Gu Pengjie, Neurocomputing . 2018,第NOVa3期

机译：基于峰值的编码和频谱特征学习，可实现可靠的声音识别
2. Efficient Local Feature Encoding for Human Action Recognition with Approximate Sparse Coding [J] . Yu WANG, Jien KATO IEICE transactions on information and systems . 2016,第4期

机译：近似稀疏编码的有效人类行为识别局部特征编码
3. Block Sparse Bayesian Learning over Local Dictionary for Robust SAR Target Recognition [J] . Chenyu Li, Guohua Liu International Journal of Optics . 2020,第3期

机译：阻止稀疏贝叶斯在局部词典中学习强大的SAR目标识别
4. Learning weighted sparse representation of encoded facial normal information for expression-robust 3D face recognition [C] . Huibin Li, Di Huang, Morvan Jean-Marie, 2011 International Joint Conference on Biometrics . 2011

机译：学习编码的面部正常信息的加权稀疏表示以实现鲁棒的3D人脸识别
5. Sparse Methods for Robust and Efficient Visual Recognition. [D] . Shekhar, Sumit. 2014

机译：鲁棒高效的视觉识别的稀疏方法。
6. Robust Single-Sample Face Recognition by Sparsity-Driven Sub-Dictionary Learning Using Deep Features [O] . Vittorio Cuculo, Alessandro D’Amelio, Giuliano Grossi, 2019

机译：通过深度特征稀疏驱动的子字典学习进行稳健的单样本人脸识别
7. On robust face recognition via sparse encoding : the good, the bad, and the ugly [O] . Wong Yongkang, Harandi Mehrtash T., Sanderson Conrad 2013

机译：通过稀疏编码进行稳健的人脸识别：好，坏和丑陋
8. Some methods of encoding simple visual images for use with a sparse distributed memory, with applications to character recognition [R] . Jaeckel, Louis A. 1989

机译：一些编码简单视觉图像的方法，用于稀疏分布式存储器，应用于字符识别

Robust Environmental Sound Recognition With Sparse Key-Point Encoding and Efficient Multispike Learning

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅