首页> 美国卫生研究院文献>other >Unsupervised speech recognition through spike-timing-dependent plasticity in a convolutional spiking neural network

【2h】

Unsupervised speech recognition through spike-timing-dependent plasticity in a convolutional spiking neural network

机译：卷积加标神经网络中通过依赖于时序的可塑性实现无监督语音识别

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Speech recognition (SR) has been improved significantly by artificial neural networks (ANNs), but ANNs have the drawbacks of biologically implausibility and excessive power consumption because of the nonlocal transfer of real-valued errors and weights. While spiking neural networks (SNNs) have the potential to solve these drawbacks of ANNs due to their efficient spike communication and their natural way to utilize kinds of synaptic plasticity rules found in brain for weight modification. However, existing SNN models for SR either had bad performance, or were trained in biologically implausible ways. In this paper, we present a biologically inspired convolutional SNN model for SR. The network adopts the time-to-first-spike coding scheme for fast and efficient information processing. A biological learning rule, spike-timing-dependent plasticity (STDP), is used to adjust the synaptic weights of convolutional neurons to form receptive fields in an unsupervised way. In the convolutional structure, the strategy of local weight sharing is introduced and could lead to better feature extraction of speech signals than global weight sharing. We first evaluated the SNN model with a linear support vector machine (SVM) on the TIDIGITS dataset and it got the performance of 97.5%, comparable to the best results of ANNs. Deep analysis on network outputs showed that, not only are the output data more linearly separable, but they also have fewer dimensions and become sparse. To further confirm the validity of our model, we trained it on a more difficult recognition task based on the TIMIT dataset, and it got a high performance of 93.8%. Moreover, a linear spike-based classifier—tempotron—can also achieve high accuracies very close to that of SVM on both the two tasks. These demonstrate that an STDP-based convolutional SNN model equipped with local weight sharing and temporal coding is capable of solving the SR task accurately and efficiently.

机译：人工神经网络（ANN）极大地改善了语音识别（SR），但由于实值误差和权重的非本地传递，人工神经网络具有生物学上的不可靠性和功耗过大的缺点。虽然尖峰神经网络（SNN）可以解决ANN的这些弊端，这是由于其有效的尖峰交流以及利用大脑中发现的各种突触可塑性规则进行体重调节的自然方式。但是，现有的SR SNN模型要么性能不佳，要么以生物学上难以置信的方式进行了训练。在本文中，我们提出了一种生物学启发的卷积SNN模型。该网络采用时间到峰值时间编码方案，以进行快速有效的信息处理。一种生物学学习规则，即依赖于时序定时的可塑性（STDP），用于调节卷积神经元的突触权重以无监督的方式形成感受野。在卷积结构中，引入了局部权重共享策略，与全局权重共享相比，它可以导致语音信号的特征提取更好。我们首先在TIDIGITS数据集上使用线性支持向量机（SVM）评估了SNN模型，其性能为97.5％，可与ANN的最佳结果相媲美。对网络输出的深入分析表明，不仅输出数据在线性上更可分离，而且它们的维数更少并且变得稀疏。为了进一步确认我们模型的有效性，我们在TIMIT数据集的基础上对它进行了更困难的识别任务训练，并获得了93.8％的高性能。此外，在两个任务上，基于线性峰值的分类器-Tempotron-也可以实现非常接近SVM的高精度。这些证明基于STDP的卷积SNN模型配备了局部权重共享和时间编码，能够准确有效地解决SR任务。

著录项

期刊名称 other
作者
Meng Dong; Xuhui Huang; Bo Xu;
展开▼
作者单位

展开▼
年(卷),期 -1(13),11
年度 -1
页码 e0204596
总页数 19
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Deep Spiking Convolutional Neural Network Trained With Unsupervised Spike-Timing-Dependent Plasticity [J] . Lee Chankyu, Srinivasan Gopalakrishnan, Panda Priyadarshini, IEEE Transactions on Cognitive and Developmental Systems . 2019,第3期

机译：深度峰值卷积神经网络的无监督峰值计时依赖可塑性训练
2. Deep Spiking Convolutional Neural Network Trained With Unsupervised Spike-Timing-Dependent Plasticity [J] . Lee Chankyu, Srinivasan Gopalakrishnan, Panda Priyadarshini, IEEE Transactions on Cognitive and Developmental Systems . 2019,第3期

机译：深度尖峰卷积神经网络训练，具有无监督的峰值定时依赖性可塑性
3. Bio-inspired digit recognition using reward-modulated spike-timing-dependent plasticity in deep convolutional networks [J] . Mozafari Milad, Ganjtabesh Mohammad, Nowzari-Dalini Abbas, Pattern Recognition: The Journal of the Pattern Recognition Society . 2019,第期

机译：生物启发数字识别在深卷积网络中使用奖励调制的尖峰定时依赖性可塑性
4. A Spiking Neural Network with a Global Self-Controller for Unsupervised Learning Based on Spike-Timing-Dependent Plasticity Using Flash Memory Synaptic Devices [C] . Won-Mook Kang, Chul-Heung Kim, Soochang Lee, International Joint Conference on Neural Networks . 2019

机译：基于尖峰计时相关可塑性的具有闪存自控装置的具有全局自控制器的尖峰神经网络用于无监督学习
5. Convolutional Neural Networks for Speaker-Independent Speech Recognition. [D] . Belilovsky, Eugene. 2011

机译：用于与说话人无关的语音识别的卷积神经网络。
6. Spike-Timing-Dependent Plasticity and Short-Term Plasticity Jointly Control the Excitation of Hebbian Plasticity withoutWeight Constraints in Neural Networks [O] . Subha Fernando, Koichi Yamada 2012

机译：不依赖尖峰时间的可塑性和短期可塑性共同控制Hebbian可塑性的激发神经网络中的权重约束
7. Unsupervised speech recognition through spike-timing-dependent plasticity in a convolutional spiking neural network [O] . Meng Dong, Xuhui Huang, Bo Xu 2018

机译：通过卷积尖峰神经网络中的峰时依赖性可塑性无监督的语音识别

Unsupervised speech recognition through spike-timing-dependent plasticity in a convolutional spiking neural network

摘要

著录项

相似文献

相关主题

期刊订阅