首页> 外文会议>International Symposium on Chinese Spoken Language Processing >A speaker-dependent deep learning approach to joint speech separation and acoustic modeling for multi-talker automatic speech recognition
【24h】

A speaker-dependent deep learning approach to joint speech separation and acoustic modeling for multi-talker automatic speech recognition

机译:基于说话者的深度学习方法,用于多说话者自动语音识别的联合语音分离和声学建模

获取原文

摘要

We propose a novel speaker-dependent (SD) approach to joint training of deep neural networks (DNNs) with an explicit speech separation structure for multi-talker speech recognition in a single-channel setting. First, a multi-condition training strategy is designed for a SD-DNN recognizer in multi-talker scenarios, which can significantly reduce the decoding runtime and improve the recognition accuracy over the approaches that use speaker-independent DNN models with a complicated joint decoding framework. In addition, a SD regression DNN for mapping the acoustic features of mixed speech to the speech features of a target speaker is jointly trained with the SD recognition DNN for acoustic modeling. Our experiments on the Speech Separation Challenge (SSC) task show that the proposed SD recognition system under multi-condition training achieves an average word error rate (WER) of 3.8%, yielding a relative WER reduction of 65.1% from the proposed DNN preprocessing approach under clean-condition training [1]. Furthermore, the jointly trained DNN system generates a relative WER reduction of 13.2% from the state-of-the-art systems under multi-condition training.
机译:我们提出了一种新颖的基于说话者的(SD)方法,用于深度神经网络(DNN)的联合训练,具有显式的语音分离结构,可在单通道设置中进行多方对话者语音识别。首先,针对多说话者场景中的SD-DNN识别器设计了一种多条件训练策略,与使用独立于说话者的DNN模型和复杂的联合解码框架的方法相比,该方法可以显着减少解码时间并提高识别精度。另外,将用于将混合语音的声学特征映射到目标说话者的语音特征的SD回归DNN与用于声学建模的SD识别DNN一起进行训练。我们对语音分离挑战(SSC)任务的实验表明,所提出的SD识别系统在多条件训练下的平均单词错误率(WER)为3.8%,与所提出的DNN预处理方法相比,相对WER降低了65.1%在清洁条件下接受培训[1]。此外,经过联合训练的DNN系统在多条件训练下与最新系统相比,相对WER降低了13.2%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号