首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Transformer-Based Online CTC/Attention End-To-End Speech Recognition Architecture
【24h】

Transformer-Based Online CTC/Attention End-To-End Speech Recognition Architecture

机译:基于变压器的在线CTC /注意力端到端语音识别架构

获取原文

摘要

Recently, Transformer has gained success in automatic speech recognition (ASR) field. However, it is challenging to deploy a Transformer-based end-to-end (E2E) model for online speech recognition. In this paper, we propose the Transformer-based online CTC/attention E2E ASR architecture, which contains the chunk self-attention encoder (chunk-SAE) and the monotonic truncated attention (MTA) based self-attention decoder (SAD). Firstly, the chunk-SAE splits the speech into isolated chunks. To reduce the computational cost and improve the performance, we propose the state reuse chunk-SAE. Sencondly, the MTA based SAD truncates the speech features monotonically and performs attention on the truncated features. To support the online recognition, we integrate the state reuse chunk-SAE and the MTA based SAD into online CTC/attention architecture. We evaluate the proposed online models on the HKUST Mandarin ASR benchmark and achieve a 23.66% character error rate (CER) with a 320 ms latency. Our online model yields as little as 0.19% absolute CER degradation compared with the offline baseline, and achieves significant improvement over our prior work on Long Short-Term Memory (LSTM) based online E2E models.
机译:最近,Transformer在自动语音识别(ASR)领域获得了成功。但是,部署基于变压器的端到端(E2E)模型进行在线语音识别具有挑战性。在本文中,我们提出了一种基于变压器的在线CTC /注意力E2E ASR体系结构,该体系结构包含块自我注意编码器(chunk-SAE)和基于单调截断注意(MTA)的自我注意解码器(SAD)。首先,块SAE将语音拆分为孤立的块。为了降低计算成本并提高性能,我们提出了状态重用块-SAE。其次,基于MTA的SAD会单调截断语音特征,并注意截断的特征。为了支持在线识别,我们将状态重用块SAE和基于MTA的SAD集成到在线CTC /注意架构中。我们以HKUST普通话ASR基准评估了建议的在线模型,并以320 ms的延迟实现了23.66%的字符错误率(CER)。与离线基准相比,我们的在线模型产生的CER绝对降低仅为0.19%,并且与我们之前基于长期短期记忆(LSTM)的在线E2E模型的工作相比,取得了显着改善。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号