首页> 外文期刊>Neural Networks and Learning Systems, IEEE Transactions on >Robust Environmental Sound Recognition With Sparse Key-Point Encoding and Efficient Multispike Learning
【24h】

Robust Environmental Sound Recognition With Sparse Key-Point Encoding and Efficient Multispike Learning

机译:强大的环境声音识别与稀疏关键点编码和高效的多分层学习

获取原文
获取原文并翻译 | 示例
           

摘要

The capability for environmental sound recognition (ESR) can determine the fitness of individuals in a way to avoid dangers or pursue opportunities when critical sound events occur. It still remains mysterious about the fundamental principles of biological systems that result in such a remarkable ability. Additionally, the practical importance of ESR has attracted an increasing amount of research attention, but the chaotic and nonstationary difficulties continue to make it a challenging task. In this article, we propose a spike-based framework from a more brain-like perspective for the ESR task. Our framework is a unifying system with consistent integration of three major functional parts which are sparse encoding, efficient learning, and robust readout. We first introduce a simple sparse encoding, where key points are used for feature representation, and demonstrate its generalization to both spike- and nonspike-based systems. Then, we evaluate the learning properties of different learning rules in detail with our contributions being added for improvements. Our results highlight the advantages of multispike learning, providing a selection reference for various spike-based developments. Finally, we combine the multispike readout with the other parts to form a system for ESR. Experimental results show that our framework performs the best as compared to other baseline approaches. In addition, we show that our spike-based framework has several advantageous characteristics including early decision making, small dataset acquiring, and ongoing dynamic processing. Our framework is the first attempt to apply the multispike characteristic of nervous neurons to ESR. The outstanding performance of our approach would potentially contribute to draw more research efforts to push the boundaries of spike-based paradigm to a new horizon.
机译:环境声音识别(ESR)的能力可以以一种方式确定个人的适应性,以避免危险或追求批判声音事件的危险。它仍然是关于生物系统的基本原则的神秘,导致这种显着的能力。此外,ESR的实际重要性引起了越来越多的研究关注,但混乱和非间断的困难继续使其成为一个具有挑战性的任务。在本文中,我们向ESR任务的更脑的角度提出了一种基于峰值的框架。我们的框架是一个统一系统,具有一致的三个主要功能部件,这是稀疏编码,高效学习和恢复读出的主要功能部件。我们首先介绍一个简单的稀疏编码,其中关键点用于特征表示,并展示其对基于尖峰和基于峰值的系统的概括。然后,我们详细评估了不同学习规则的学习属性,我们为改进添加了我们的贡献。我们的结果突出了多点学习的优势,为各种基于尖峰的发展提供选择参考。最后,我们将多点读数与其他部分相结合以形成ESR的系统。实验结果表明,与其他基线方法相比,我们的框架表现了最佳。此外,我们表明我们的峰值框架具有几个有利的特征,包括早期决策,小型数据集获取和正在进行的动态处理。我们的框架是第一次尝试将神经神经元的多点特征应用于ESR。我们的方法的突出表现可能会有助于提高更多的研究努力,将基于峰值范式的界限推向一个新的地平线。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号