首页> 外文会议>IEEE International Conference on Consumer Electronics - Berlin >Studying the Effects of Feature Extraction Settings on the Accuracy and Memory Requirements of Neural Networks for Keyword Spotting
【24h】

Studying the Effects of Feature Extraction Settings on the Accuracy and Memory Requirements of Neural Networks for Keyword Spotting

机译:研究特征提取设置对关键词识别神经网络精度和内存需求的影响

获取原文

摘要

Due to the always-on nature of keyword spotting (KWS) systems, low power consumption micro-controller units (MCU) are the best choices as deployment devices. However, small computation power and memory budget of MCUs can harm the accuracy requirements. Although, many studies have been conducted to design small memory footprint neural networks to address this problem, the effects of different feature extraction settings are rarely studied. This work addresses this important question by first, comparing six of the most popular and state of the art neural network architectures for KWS on the Google Speech-Commands dataset. Then, keeping the network architectures unchanged it performs comprehensive investigations on the effects of different frequency transformation settings, such as number of used mel-frequency cepstrum coefficients (MFCCs) and length of the stride window, on the accuracy and memory footprint (RAM/ROM) of the models. The results show different preprocessing settings can change the accuracy and RAM/ROM requirements significantly of the models. Furthermore, it is shown that DS-CNN outperforms the other architectures in terms of accuracy with a value of 93.47% with least amount of ROM requirements, while the GRU outperforms all other networks with an accuracy of 91.02% with smallest RAM requirements.
机译:由于关键字搜寻(KWS)系统始终处于打开状态,因此低功耗微控制器单元(MCU)是部署设备的最佳选择。但是,MCU的小的计算能力和内存预算可能会损害准确性要求。尽管已经进行了许多研究来设计小型内存占用神经网络来解决此问题,但是很少研究不同特征提取设置的影响。这项工作首先解决了这个重要问题,在Google Speech-Commands数据集上比较了KWS的六个最受欢迎和最先进的神经网络体系结构。然后,在保持网络体系结构不变的情况下,它对不同的频率转换设置(例如使用的mel频率倒谱系数(MFCC)的数量和步幅窗口的长度)对准确性和内存占用量(RAM / ROM)的影响进行了全面的研究。 )的模型。结果表明,不同的预处理设置可以显着改变模型的准确性和RAM / ROM要求。此外,表明DS-CNN在精度方面以93.47%的值优于其他体系结构,而对ROM的需求最少,而GRU在所有RAM方面的需求以91.02%的精度优于所有其他网络。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号