...
首页> 外文期刊>Instrumentation and Measurement, IEEE Transactions on >An FPGA-Based Embedded Robust Speech Recognition System Designed by Combining Empirical Mode Decomposition and a Genetic Algorithm
【24h】

An FPGA-Based Embedded Robust Speech Recognition System Designed by Combining Empirical Mode Decomposition and a Genetic Algorithm

机译:结合经验模式分解和遗传算法设计的基于FPGA的嵌入式鲁棒语音识别系统

获取原文
获取原文并翻译 | 示例
           

摘要

A field-programmable gate array (FPGA)-based robust speech measurement and recognition system is the focus of this paper, and the environmental noise problem is its main concern. To accelerate the recognition speed of the FPGA-based speech recognition system, the discrete hidden Markov model is used here to lessen the computation burden inherent in speech recognition. Furthermore, the empirical mode decomposition is used to decompose the measured speech signal contaminated by noise into several intrinsic mode functions (IMFs). The IMFs are then weighted and summed to reconstruct the original clean speech signal. Unlike previous research, in which IMFs were selected by trial and error for specific applications, the weights for each IMF are designed by the genetic algorithm to obtain an optimal solution. The experimental results in this paper reveal that this method achieves a better speech recognition rate for speech subject to various environmental noises. Moreover, this paper also explores the hardware realization of the designed speech measurement and recognition systems on an FPGA-based embedded system with the System-On-a-Chip (SOC) architecture. Since the central-processing-unit core adopted in the SOC has limited computation ability, this paper uses the integer fast Fourier transform (FFT) to replace the floating-point FFT to speed up the computation for capturing speech features through a mel-frequency cepstrum coefficient. The result is a significant reduction in the calculation time without influencing the speech recognition rate. It can be seen from the experiments in this paper that the performance of the implemented hardware is significantly better than that of existing research.
机译:基于现场可编程门阵列(FPGA)的鲁棒语音测量和识别系统是本文的重点,而环境噪声问题是其主要关注点。为了加快基于FPGA的语音识别系统的识别速度,在此使用离散隐马尔可夫模型来减轻语音识别中固有的计算负担。此外,经验模式分解被用于将被噪声污染的测量语音信号分解为几个固有模式函数(IMF)。然后对IMF进行加权和求和,以重建原始的干净语音信号。不同于先前的研究,IMF是通过反复试验针对特定应用选择的,而每个IMF的权重都是通过遗传算法设计的,以获得最佳解决方案。实验结果表明,该方法对于各种环境噪声下的语音都具有较好的语音识别率。此外,本文还探讨了在具有片上系统(SOC)架构的基于FPGA的嵌入式系统上设计语音测量和识别系统的硬件实现。由于SOC中采用的中央处理单元核心的计算能力有限,因此本文使用整数快速傅立叶变换(FFT)代替浮点FFT,以加快通过mel频率倒谱捕获语音特征的计算系数。结果是显着减少了计算时间,而不会影响语音识别率。从本文的实验可以看出,所实现的硬件的性能明显优于现有研究。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号