首页> 外文会议>International Symposium on Chinese Spoken Language Processing >Context-dependent Label Smoothing Regularization for Attention-based End-to-End Code-Switching Speech Recognition
【24h】

Context-dependent Label Smoothing Regularization for Attention-based End-to-End Code-Switching Speech Recognition

机译:基于关注的端到端代码切换语音识别的上下文相关标签平滑正则化

获取原文

摘要

Previous works utilize the context-independent (CI) label smoothing regularization (LSR) method to prevent attention-based End-to-End (E2E) automatic speech recognition (ASR) model, which is trained with a cross entropy loss function and hard labels, from making over-confident predictions. But the CI LSR method does not make use of linguistic knowledge within and between languages in the case of code-switching speech recognition (CSSR). In this paper, we propose the context-dependent (CD) LSR method. According to code-switching linguistic knowledge, the output units are classified into several categories and several context dependency rules are made. Under the guidance of the context dependency rules, prior label distribution is generated dynamically according to the category of historical context, rather than being fixed. Thus, the CD LSR method can utilize the linguistic knowledge in the case of CSSR to further improve the performance of the model. Experiments on the SEAME corpus demonstrate the effects of the proposed method. The final system with the CD LSR method achieves the best performance with 37.21% mixed error rate (MER), obtaining up to 3.7% relative MER reduction compared to the baseline system with no LSR method.
机译:以前的作品利用上下文无关(CI)标签平滑正规化(LSR)方法,以防止注意力为主端至端(E2E)自动语音识别(ASR)模型,用交叉熵损失函数和硬标签受训,作出过分自信的预测。但CI LSR方法不会使内部和码转换的语音识别(CSSR)的情况下,语言之间使用的语言知识。在本文中,我们提出了上下文相关(CD)LSR方法。据码转换语言知识,所述输出单元被分为几个类别和几个上下文依赖关系规则制成。下的上下文依赖关系规则的指导下,根据历史背景的类别动态生成的现有的标签分配,而不是被固定的。因此,CD LSR方法可以利用在CSSR的情况下,语言知识,进一步提高模型的性能。在SEAME语料库实验证明了该方法的效果。与CD LSR方法最终的系统实现了与37.21%混合的误差率(MER)的最佳性能,相比于用没有LSR方法基线系统获得高达相对MER减少3.7%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号