首页> 美国卫生研究院文献>other >Unsupervised speech recognition through spike-timing-dependent plasticity in a convolutional spiking neural network
【2h】

Unsupervised speech recognition through spike-timing-dependent plasticity in a convolutional spiking neural network

机译:卷积加标神经网络中通过依赖于时序的可塑性实现无监督语音识别

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Speech recognition (SR) has been improved significantly by artificial neural networks (ANNs), but ANNs have the drawbacks of biologically implausibility and excessive power consumption because of the nonlocal transfer of real-valued errors and weights. While spiking neural networks (SNNs) have the potential to solve these drawbacks of ANNs due to their efficient spike communication and their natural way to utilize kinds of synaptic plasticity rules found in brain for weight modification. However, existing SNN models for SR either had bad performance, or were trained in biologically implausible ways. In this paper, we present a biologically inspired convolutional SNN model for SR. The network adopts the time-to-first-spike coding scheme for fast and efficient information processing. A biological learning rule, spike-timing-dependent plasticity (STDP), is used to adjust the synaptic weights of convolutional neurons to form receptive fields in an unsupervised way. In the convolutional structure, the strategy of local weight sharing is introduced and could lead to better feature extraction of speech signals than global weight sharing. We first evaluated the SNN model with a linear support vector machine (SVM) on the TIDIGITS dataset and it got the performance of 97.5%, comparable to the best results of ANNs. Deep analysis on network outputs showed that, not only are the output data more linearly separable, but they also have fewer dimensions and become sparse. To further confirm the validity of our model, we trained it on a more difficult recognition task based on the TIMIT dataset, and it got a high performance of 93.8%. Moreover, a linear spike-based classifier—tempotron—can also achieve high accuracies very close to that of SVM on both the two tasks. These demonstrate that an STDP-based convolutional SNN model equipped with local weight sharing and temporal coding is capable of solving the SR task accurately and efficiently.
机译:人工神经网络(ANN)极大地改善了语音识别(SR),但由于实值误差和权重的非本地传递,人工神经网络具有生物学上的不可靠性和功耗过大的缺点。虽然尖峰神经网络(SNN)可以解决ANN的这些弊端,这是由于其有效的尖峰交流以及利用大脑中发现的各种突触可塑性规则进行体重调节的自然方式。但是,现有的SR SNN模型要么性能不佳,要么以生物学上难以置信的方式进行了训练。在本文中,我们提出了一种生物学启发的卷积SNN模型。该网络采用时间到峰值时间编码方案,以进行快速有效的信息处理。一种生物学学习规则,即依赖于时序定时的可塑性(STDP),用于调节卷积神经元的突触权重以无监督的方式形成感受野。在卷积结构中,引入了局部权重共享策略,与全局权重共享相比,它可以导致语音信号的特征提取更好。我们首先在TIDIGITS数据集上使用线性支持向量机(SVM)评估了SNN模型,其性能为97.5%,可与ANN的最佳结果相媲美。对网络输出的深入分析表明,不仅输出数据在线性上更可分离,而且它们的维数更少并且变得稀疏。为了进一步确认我们模型的有效性,我们在TIMIT数据集的基础上对它进行了更困难的识别任务训练,并获得了93.8%的高性能。此外,在两个任务上,基于线性峰值的分类器-Tempotron-也可以实现非常接近SVM的高精度。这些证明基于STDP的卷积SNN模型配备了局部权重共享和时间编码,能够准确有效地解决SR任务。

著录项

  • 期刊名称 other
  • 作者

    Meng Dong; Xuhui Huang; Bo Xu;

  • 作者单位
  • 年(卷),期 -1(13),11
  • 年度 -1
  • 页码 e0204596
  • 总页数 19
  • 原文格式 PDF
  • 正文语种
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号