【24h】

Machine Speech Chain with Deep Learning

机译:深度学习机器语音链

获取原文

摘要

In this paper, we develop a closed-loop speechchain model based on deep learning and construct asequence-to-sequence model for both ASR and TTStasks as well as a loop connection between thesetwo processes. The sequence-to-sequence model inclosed-loop architecture allows us to train our modelon the concatenation of both labeled and unlabeleddata. While ASR transcribes the unlabeled speechfeatures, TTS attempts to reconstruct the originalspeech waveform based on text from ASR. In theopposite direction, ASR also reconstructs the origi-nal text transcription given the synthesized speech.To the best of our knowledge, this is the rst deeplearning model that integrates human speech per-ception and production behaviors.
机译:在本文中,我们基于深度学习开发了一个闭环语音链模型,并为ASR和TTS任务以及这两个过程之间的循环连接构建了按序排序模型。序列到序列模型的闭环体系结构使我们能够在标记数据和未标记数据的串联上训练我们的模型。当ASR转录未标记的语音功能时,TTS会尝试根据ASR的文本来重建原始语音波形。在相反的方向上,ASR还会在合成语音的情况下重构原始文本转录。据我们所知,这是第一个将人类语音感知和生产行为整合在一起的深度学习模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号