首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Sequence-to-Sequence Automatic Speech Recognition with Word Embedding Regularization and Fused Decoding

【24h】

Sequence-to-Sequence Automatic Speech Recognition with Word Embedding Regularization and Fused Decoding

机译：词嵌入正则化和融合解码的序列到序列自动语音识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we investigate the benefit that off-the-shelf word embedding can bring to the sequence-to-sequence (seq-to-seq) automatic speech recognition (ASR). We first introduced the word embedding regularization by maximizing the cosine similarity between a transformed decoder feature and the target word embedding. Based on the regularized decoder, we further proposed the fused decoding mechanism. This allows the decoder to consider the semantic consistency during decoding by absorbing the information carried by the transformed decoder feature, which is learned to be close to the target word embedding. Initial results on LibriSpeech demonstrated that pre-trained word embedding can signifi-cantly lower ASR recognition error with a negligible cost, and the choice of word embedding algorithms among Skip-gram, CBOW and BERT is important.

机译：在本文中，我们研究了现成的词嵌入可以为序列到序列（seq-to-seq）自动语音识别（ASR）带来的好处。我们首先通过最大化变换后的解码器特征与目标词嵌入之间的余弦相似性来介绍词嵌入正则化。基于正则化解码器，我们进一步提出了融合解码机制。这允许解码器通过吸收变换后的解码器特征所携带的信息来考虑解码期间的语义一致性，该信息被学习为接近目标词嵌入。在LibriSpeech上的初步结果表明，经过预训练的词嵌入可以以可忽略的成本显着降低ASR识别错误，并且在Skip-gram，CBOW和BERT之间选择词嵌入算法非常重要。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing 》|2020年|7879-7883|共5页
会议地点
作者
Alexander H. Liu; Tzu-Wei Sung; Shun-Po Chuang; Hung-yi Lee; Lin-shan Lee;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
automatic speech recognition; sequence-to-sequence; word embedding; regularization; decoding;

机译：自动语音识别;序列到序列;词嵌入;正则化;解码;

相似文献

外文文献
中文文献
专利

1. Phonetic Words Decoding Software in the Problem of Russian Speech Recognition [J] . A. V. Savchenko Automation and Remote Control . 2013 ,第7期

机译：俄语语音识别中的语音单词解码软件
2. Latent Words Recurrent Neural Network Language Models for Automatic Speech Recognition [J] . Ryo MASUMURA, Taichi ASAMI, Takanobu OBA, IEICE transactions on information and systems . 2019 ,第12期

机译：潜在词递归神经网络语言模型用于自动语音识别
3. Domain Adaptation Based on Mixture of Latent Words Language Models for Automatic Speech Recognition [J] . Ryo MASUMURA, Taichi ASAMI, Takanobu OBA, IEICE transactions on information and systems . 2018 ,第6期

机译：基于潜在词语言模型混合的领域自适应语音自动识别
4. Sequence-to-Sequence Automatic Speech Recognition with Word Embedding Regularization and Fused Decoding [C] . Alexander H. Liu, Tzu-Wei Sung, Shun-Po Chuang, IEEE International Conference on Acoustics, Speech and Signal Processing . 2020

机译：与Word嵌入正则化和融合解码的顺序序列自动语音识别
5. Automatic word to morpheme decomposer for automatic speech recognition of Russian. [D] . Urmatbek, Jakshylyk. 2015

机译：自动词到词素分解器，用于俄语的自动语音识别。
6. Development of a Two-Stage Procedure for the Automatic Recognition of Dysfluencies in the Speech of Children Who Stutter: II. ANN Recognition of Repetitions and Prolongations With Supplied Word Segment Markers [O] . Peter Howell, Stevie Sackin, Kazan Glenn -1

机译：自动识别口吃儿童言语中流离失所的两阶段程序的发展：II。具有提供的词段标记的ANN识别重复和延长
7. Sequence-to-Sequence Automatic Speech Recognition with Word Embedding Regularization and Fused Decoding [O] . Alexander H. Liu, Tzu-Wei Sung, Shun-Po Chuang, 2020

机译：与Word嵌入正则化和融合解码的顺序序列自动语音识别

Sequence-to-Sequence Automatic Speech Recognition with Word Embedding Regularization and Fused Decoding

摘要

著录项

相似文献

相关主题

期刊订阅