首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Sequence-to-Sequence Automatic Speech Recognition with Word Embedding Regularization and Fused Decoding
【24h】

Sequence-to-Sequence Automatic Speech Recognition with Word Embedding Regularization and Fused Decoding

机译:词嵌入正则化和融合解码的序列到序列自动语音识别

获取原文

摘要

In this paper, we investigate the benefit that off-the-shelf word embedding can bring to the sequence-to-sequence (seq-to-seq) automatic speech recognition (ASR). We first introduced the word embedding regularization by maximizing the cosine similarity between a transformed decoder feature and the target word embedding. Based on the regularized decoder, we further proposed the fused decoding mechanism. This allows the decoder to consider the semantic consistency during decoding by absorbing the information carried by the transformed decoder feature, which is learned to be close to the target word embedding. Initial results on LibriSpeech demonstrated that pre-trained word embedding can signifi-cantly lower ASR recognition error with a negligible cost, and the choice of word embedding algorithms among Skip-gram, CBOW and BERT is important.
机译:在本文中,我们研究了现成的词嵌入可以为序列到序列(seq-to-seq)自动语音识别(ASR)带来的好处。我们首先通过最大化变换后的解码器特征与目标词嵌入之间的余弦相似性来介绍词嵌入正则化。基于正则化解码器,我们进一步提出了融合解码机制。这允许解码器通过吸收变换后的解码器特征所携带的信息来考虑解码期间的语义一致性,该信息被学习为接近目标词嵌入。在LibriSpeech上的初步结果表明,经过预训练的词嵌入可以以可忽略的成本显着降低ASR识别错误,并且在Skip-gram,CBOW和BERT之间选择词嵌入算法非常重要。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号