首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >A deep convolutional neural network using heterogeneous pooling for trading acoustic invariance with phonetic confusion
【24h】

A deep convolutional neural network using heterogeneous pooling for trading acoustic invariance with phonetic confusion

机译:使用异构池的深度卷积神经网络,用于在语音混乱和语音混乱之间进行权衡

获取原文

摘要

We develop and present a novel deep convolutional neural network architecture, where heterogeneous pooling is used to provide constrained frequency-shift invariance in the speech spectrogram while minimizing speech-class confusion induced by such invariance. The design of the pooling layer is guided by domain knowledge about how speech classes would change when formant frequencies are modified. The convolution and heterogeneous-pooling layers are followed by a fully connected multi-layer neural network to form a deep architecture interfaced to an HMM for continuous speech recognition. During training, all layers of this entire deep net are regularized using a variant of the “dropout” technique. Experimental evaluation demonstrates the effectiveness of both heterogeneous pooling and dropout regularization. On the TIMIT phonetic recognition task, we have achieved an 18.7% phone error rate, lowest on this standard task reported in the literature with a single system and with no use of information about speaker identity. Preliminary experiments on large vocabulary speech recognition in a voice search task also show error rate reduction using heterogeneous pooling in the deep convolutional neural network.
机译:我们开发并提出了一种新的深度卷积神经网络架构,其中异构池用于在语音谱图中提供受约束的频率换档不变性,同时最小化这种不变性引起的语音级混淆。池池层的设计是由域名知识引导的关于语音类如何在修改中的频率时如何改变。卷积和异构池层之后是完全连接的多层神经网络,以形成接合到HMM的深度架构以进行连续语音识别。在培训期间,使用“辍学”技术的变种来规范整个深网络的所有层。实验评估证明了异质汇集和辍学规范化的有效性。在Timit语音识别任务上,我们已经实现了18.7%的电话错误率,在文献中报告的该标准任务最低,具有单个系统,无需使用有关扬声器标识的信息。语音搜索任务中大词汇语音识别的初步实验还显示了在深卷积神经网络中使用异构池的错误率降低。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号