A Generic and Scalable Architecture for a Large Acoustic Model and Large Vocabulary Speech Recognition Accelerator Using Logic on Memory

Bapat O.A.; Franzon P.D.; Fastow R.M.

首页> 外文期刊>Very Large Scale Integration (VLSI) Systems, IEEE Transactions on >A Generic and Scalable Architecture for a Large Acoustic Model and Large Vocabulary Speech Recognition Accelerator Using Logic on Memory

【24h】

A Generic and Scalable Architecture for a Large Acoustic Model and Large Vocabulary Speech Recognition Accelerator Using Logic on Memory

机译：基于内存逻辑的大型声学模型和大型词汇语音识别加速器的通用可扩展体系结构

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper describes a scalable hardware accelerator for speech recognition, which uses a two pass decoding algorithm with word dependent N-best Viterbi Beam Search. The observation probability calculation (Senone scoring) and first pass of decoding using a Bigram language model is implemented in hardware. The word lattice output from the first pass is used by software for the second pass, with a trigram language model. The proposed design uses a logic-on-memory approach to make use of high bandwidth nor flash memory to improve random read performance for Senone scoring and first pass decoding, both of which are memory intensive operations. The proposed HW/SW co-design achieves an overall speed up of 4.3X over a 2.4-GHz Intel Core 2 Duo processor running the CMU Sphinx speech recognition software, while consuming an estimated 1.72 W of power. The hardware accelerator provides improved speech recognition accuracy by supporting larger acoustic models and word dictionaries while maintaining real-time performance.

机译：本文介绍了一种用于语音识别的可扩展硬件加速器，该加速器使用带有单词相关N最佳维特比波束搜索的两遍解码算法。硬件中实现了观察概率计算（Senone评分）和使用Bigram语言模型进行解码的第一遍。软件将第一遍输出的单词点阵输出与Trigram语言模型一起用于第二遍。拟议的设计使用了一种基于内存的逻辑方法来利用高带宽或闪存来提高Senone评分和首过解码的随机读取性能，这两个都是内存密集型操作。与运行CMU Sphinx语音识别软件的2.4 GHz Intel Core 2 Duo处理器相比，拟议的硬件/软件协同设计可将整体速度提高4.3倍，同时消耗大约1.72 W的功率。硬件加速器通过支持更大的声学模型和单词词典，同时保持实时性能，提高了语音识别的准确性。

著录项

来源
《Very Large Scale Integration (VLSI) Systems, IEEE Transactions on》 |2014年第12期|2701-2712|共12页
作者
Bapat O.A.; Franzon P.D.; Fastow R.M.;
展开▼
作者单位

, Spansion Inc., Sunnyvale, CA, USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Acoustic beams; Acoustics; Decoding; Hardware; Hidden Markov models; Software; Speech recognition; Accelerator; N-best; beam search; embedded; hardware software co-design; logic on memory; multipass decoding; speech recognition; sphinx; sphinx.;

机译：声束;声学;解码;硬件;隐马尔可夫模型;软件;语音识别;加速器;N最佳;波束搜索;嵌入式;硬件软件协同设计;内存逻辑;多遍解码;语音识别;狮身人面像;狮身人面像;

相似文献

外文文献
中文文献
专利

1. Building DNN acoustic models for large vocabulary speech recognition [J] . Andrew L. Maas, Peng Qi, Ziang Xie, Computer speech and language . 2017,第jana期

机译：建立用于大词汇量语音识别的DNN声学模型
2. A comparative study on selecting acoustic modeling units in deep neural networks based large vocabulary Chinese speech recognition [J] . Li Xiangang, Yang Yuning, Pang Zaihu, Neurocomputing . 2015,第deca25期

机译：基于大词汇量中文语音识别的深度神经网络中声学建模单元选择的比较研究
3. Boosting HMM acoustic models in large vocabulary speech recognition [J] . Meyer C, Schramm H Speech Communication . 2006,第5期

机译：在大词汇量语音识别中增强HMM声学模型
4. Effective Acoustic Modeling for Rate-of-Speech Variation in Large Vocabulary Conversational Speech Recognition [C] . Jing Zheng, Horatio Franco, Andreas Stolcke International Conference on Spoken Language Processing; 20041004-08; Jeju(KR) . 2004

机译：大型词汇会话语音识别中语速变化的有效声学建模
5. A Generic, Scalable Architecture for a Large Acoustic Model and Large Vocabulary Speech Recognition Accelerator. [D] . Bapat, Ojas Ashok. 2013

机译：用于大型声学模型和大型词汇语音识别加速器的通用可扩展体系结构。
6. Using Morphological Data in Language Modeling for Serbian Large Vocabulary Speech Recognition [O] . Edvin Pakoci, Branislav Popović, Darko Pekar 2019

机译：在塞尔维亚大型词汇语音识别的语言建模中使用形态学数据
7. Neural Speech Recognizer: Acoustic-to-Word LSTM Model for Large Vocabulary Speech Recognition [O] . Soltau, Hagen, Liao, Hank, Sak, Hasim 2016

机译：神经语音识别器：用于大型的声学到单词LsTm模型词汇语音识别

A Generic and Scalable Architecture for a Large Acoustic Model and Large Vocabulary Speech Recognition Accelerator Using Logic on Memory

摘要

著录项

相似文献

相关主题

期刊订阅