首页> 美国卫生研究院文献>Frontiers in Neuroscience >Biologically-Inspired Spike-Based Automatic Speech Recognition of Isolated Digits Over a Reproducing Kernel Hilbert Space

【2h】

Biologically-Inspired Spike-Based Automatic Speech Recognition of Isolated Digits Over a Reproducing Kernel Hilbert Space

机译：仿生希尔伯特空间上基于数字启发的基于穗的孤立数字自动语音识别

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper presents a novel real-time dynamic framework for quantifying time-series structure in spoken words using spikes. Audio signals are converted into multi-channel spike trains using a biologically-inspired leaky integrate-and-fire (LIF) spike generator. These spike trains are mapped into a function space of infinite dimension, i.e., a Reproducing Kernel Hilbert Space (RKHS) using point-process kernels, where a state-space model learns the dynamics of the multidimensional spike input using gradient descent learning. This kernelized recurrent system is very parsimonious and achieves the necessary memory depth via feedback of its internal states when trained discriminatively, utilizing the full context of the phoneme sequence. A main advantage of modeling nonlinear dynamics using state-space trajectories in the RKHS is that it imposes no restriction on the relationship between the exogenous input and its internal state. We are free to choose the input representation with an appropriate kernel, and changing the kernel does not impact the system nor the learning algorithm. Moreover, we show that this novel framework can outperform both traditional hidden Markov model (HMM) speech processing as well as neuromorphic implementations based on spiking neural network (SNN), yielding accurate and ultra-low power word spotters. As a proof of concept, we demonstrate its capabilities using the benchmark TI-46 digit corpus for isolated-word automatic speech recognition (ASR) or keyword spotting. Compared to HMM using Mel-frequency cepstral coefficient (MFCC) front-end without time-derivatives, our MFCC-KAARMA offered improved performance. For spike-train front-end, spike-KAARMA also outperformed state-of-the-art SNN solutions. Furthermore, compared to MFCCs, spike trains provided enhanced noise robustness in certain low signal-to-noise ratio (SNR) regime.

机译：本文提出了一种新颖的实时动态框架，用于使用尖峰来量化口语中的时间序列结构。使用受生物启发的泄漏集成与发射（LIF）尖峰发生器，音频信号被转换为多通道尖峰序列。这些尖峰序列被映射到无限维的功能空间中，即使用点处理内核的重现内核希尔伯特空间（RKHS），其中状态空间模型使用梯度下降学习来学习多维尖峰输入的动力学。这个内核化的递归系统非常简约，并且当利用音素序列的全部上下文进行有区别的训练时，通过对其内部状态的反馈来获得必要的存储深度。在RKHS中使用状态空间轨迹对非线性动力学建模的主要优点是，它对外源输入与其内部状态之间的关系没有任何限制。我们可以自由选择具有适当内核的输入表示形式，并且更改内核不会影响系统或学习算法。此外，我们表明，这种新颖的框架可以胜过传统的隐马尔可夫模型（HMM）语音处理以及基于尖峰神经网络（SNN）的神经形态实现，从而产生准确且超低功耗的单词查找器。作为概念验证，我们使用基准TI-46数字语料库演示了其用于隔离词自动语音识别（ASR）或关键字查找的功能。与使用不带时间导数的梅尔频率倒谱系数（MFCC）前端的HMM相比，我们的MFCC-KAARMA提供了更高的性能。对于峰值列车的前端，峰值KAARMA的性能也优于最新的SNN解决方案。此外，与MFCC相比，尖峰序列在某些低信噪比（SNR）方案中提供了增强的噪声鲁棒性。

著录项

期刊名称 Frontiers in Neuroscience
作者
Kan Li; José C. Príncipe;
展开▼
作者单位

展开▼
年(卷),期 2018(12),-1
年度 2018
页码 194
总页数 17
原文格式 PDF
正文语种
中图分类神经科学;
关键词
spike-based learning noise-robust automatic speech recognition (ASR) keyword spotting kernel adaptive filtering (KAF) reproducing kernel Hilbert space (RKHS) kernel method neuromorphic computation;

机译：基于峰值的学习;鲁棒自动语音识别（ASR）;关键字识别;内核自适应滤波（KAF）;再现内核希尔伯特空间（RKHS）;内核方法;神经形态计算;

相似文献

外文文献
中文文献
专利

1. Biologically-Inspired Spike-Based Automatic Speech Recognition of Isolated Digits Over a Reproducing Kernel Hilbert Space [J] . Kan Li, José C. Príncipe Frontiers in Neuroscience . 2018,第2017期

机译：仿生希尔伯特空间上基于数字启发的基于穗的孤立数字自动语音识别
2. Speech enhancement method based on low-rank approximation in a reproducing kernel Hilbert space [J] . Zhao Yanping, Qiu Robert Caiming, Zhao Xiaohui, Applied Acoustics . 2016,第nova期

机译：再现核希尔伯特空间中基于低秩逼近的语音增强方法
3. Robust Speech Feature Extraction by Growth Transformation in Reproducing Kernel Hilbert Space [J] . Chakrabartty S., Yunbin Deng, Cauwenberghs G. IEEE transactions on audio, speech and language processing . 2007,第6期

机译：再生核希尔伯特空间中通过增长变换的鲁棒语音特征提取
4. A Reproducing Kernel Hilbert Space Approach for Speech Enhancement [C] . Oliver Gauci, Carl J. Debono, Paul Micallef International Symposium on Communications, Control and Signal Processing . 2008

机译：一种再现核心赫伯特空间方法，用于语音增强
5. Penalized likelihood regression in reproducing kernel Hilbert spaces with randomized covariate data. [D] . Ma, Xiwen. 2010

机译：使用随机协变量数据再现内核希尔伯特空间时的惩罚似然回归。
6. Reproducing Kernel Hilbert Spaces Regression Methods for Genomic Assisted Prediction of Quantitative Traits [O] . Daniel Gianola, Johannes B. C. H. M. van Kaam 2008

机译：用于基因组辅助预测数量性状的核仁希尔伯特空间回归方法
7. Robust Speech Feature Extraction by Growth Transformation in Reproducing Kernel Hilbert Space [O] . Shantanu Chakrabartty, Yunbin Deng, Gert Cauwenberghs 2007

机译：经济增长转型再现核赫伯特空间的强大语音特征提取
8. Kernel Partial Least Squares Regression in Reproducing Kernel Hilbert Space [R] . Rosipal, R., Trejo, L. J. 2001

机译：再生核Hilbert空间中的核偏最小二乘回归

Biologically-Inspired Spike-Based Automatic Speech Recognition of Isolated Digits Over a Reproducing Kernel Hilbert Space

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅