X1000 real-time phoneme recognition VLSI using feed-forward deep neural networks

机译：使用前馈深度神经网络的X1000实时音素识别VLSI

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Deep neural networks show very good performance in phoneme and speech recognition applications when compared to previously used GMM (Gaussian Mixture Model)-based ones. However, efficient implementation of deep neural networks is difficult because the network size needs to be very large when high recognition accuracy is demanded. In this work, we develop a digital VLSI for phoneme recognition using deep neural networks and assess the design in terms of throughput, chip size, and power consumption. The developed VLSI employs a fixed-point optimization method that only uses +Δ, 0, and −Δ for representing each of the weight. The design employs 1,024 simple processing units in each layer, which however can be scaled easily according to the needed throughput, and the throughput of the architecture varies from 62.5 to 1,000 times of the real-time processing speed.

机译：与以前使用的基于GMM（高斯混合模型）的神经网络相比，深度神经网络在音素和语音识别应用中显示出非常好的性能。但是，由于需要高识别精度时网络规模非常大，因此深度神经网络的有效实现非常困难。在这项工作中，我们开发了用于使用深度神经网络进行音素识别的数字VLSI，并在吞吐量，芯片尺寸和功耗方面评估了设计。所开发的VLSI采用仅使用+Δ，0和-Δ表示每个权重的定点优化方法。该设计在每层中使用1,024个简单处理单元，但是可以根据所需的吞吐量轻松进行缩放，并且体系结构的吞吐量从实时处理速度的62.5倍到1,000倍不等。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2014年|7510-7514|共5页
会议地点
作者
Kim Jonghong; Hwang Kyuyeon; Sung Wonyong;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Deep neural network; VLSI; fixed-point optimization; phoneme recognition;

机译：深度神经网络超大规模集成电路定点优化;音素识别;

相似文献

外文文献
中文文献
专利

1. A spiking neural network for real-time Spanish vowel phonemes recognition [J] . Miro-Amarante L., Gomez-Rodriguez F., Jimenez-Fernandez A., Neurocomputing . 2017,第FEBa22期

机译：尖峰神经网络，用于实时西班牙语元音音素识别
2. Real-time license plate detection and recognition using deep convolutional neural networks [J] . Silva Sergio Montazzolli, Jung Claudio Rosito Journal of visual communication & image representation . 2020,第Auga期

机译：使用深卷积神经网络的实时车牌检测和识别
3. AN INTRINSIC REAL-TIME MULTIMODAL RECOGNITION SYSTEM USING DEEP NEURAL NETWORKS [J] . J.SEETHALAKSHMI, C.JAYAKUMAR Journal of Theoretical and Applied Information Technology . 2016,第2期

机译：基于深层神经网络的内在实时多模态识别系统
4. X1000 REAL-TIME PHONEME RECOGNITION VLSI USING FEED-FORWARD DEEP NEURAL NETWORKS [C] . Jonghong Kim, Kyuyeon Hwang, Wonyong Sung IEEE International Conference on Acoustics, Speech and Signal Processing . 2014

机译：X1000使用前馈深神经网络的实时音素识别VLSI
5. Mass spectral pattern recognition by feed-forward neural network filtering. [D] . Thomas, Thomas Geiger, Jr. 1997

机译：通过前馈神经网络滤波进行质谱图模式识别。
6. EmotionNet Nano: An Efficient Deep Convolutional Neural Network Design for Real-Time Facial Expression Recognition [O] . James Ren Lee, Linda Wang, Alexander Wong 2020

机译：Emotionnet Nano：实时面部表情识别的有效深度卷积神经网络设计
7. System for characterisation and recognition of Arabic phonemes among Malaysian children using feed-forward neural networks [O] . Abdul Kadir Nurul Ashikin 2012

机译：使用前馈神经网络表征和识别马来西亚儿童中阿拉伯音素的系统

X1000 real-time phoneme recognition VLSI using feed-forward deep neural networks

摘要

著录项

相似文献

相关主题

期刊订阅