Machine Speech Chain with Deep Learning

机译：深度学习机器语音链

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we develop a closed-loop speechchain model based on deep learning and construct asequence-to-sequence model for both ASR and TTStasks as well as a loop connection between thesetwo processes. The sequence-to-sequence model inclosed-loop architecture allows us to train our modelon the concatenation of both labeled and unlabeleddata. While ASR transcribes the unlabeled speechfeatures, TTS attempts to reconstruct the originalspeech waveform based on text from ASR. In theopposite direction, ASR also reconstructs the origi-nal text transcription given the synthesized speech.To the best of our knowledge, this is the rst deeplearning model that integrates human speech per-ception and production behaviors.

机译：在本文中，我们基于深度学习开发了一个闭环语音链模型，并为ASR和TTS任务以及这两个过程之间的循环连接构建了按序排序模型。序列到序列模型的闭环体系结构使我们能够在标记数据和未标记数据的串联上训练我们的模型。当ASR转录未标记的语音功能时，TTS会尝试根据ASR的文本来重建原始语音波形。在相反的方向上，ASR还会在合成语音的情况下重构原始文本转录。据我们所知，这是第一个将人类语音感知和生产行为整合在一起的深度学习模型。

著录项

来源
《日本音響学会;日本音響学会秋季研究発表会》|2018年|939-940|共2页
会议地点 1340-3168
作者
Andros Tjandra; Sakriani Sakti; Satoshi Nakamura;
展开▼
作者单位

NAIST RIKEN AIP;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Code-Switching ASR and TTS Using Semisupervised Learning with Machine Speech Chain [J] . Sahoko NAKAYAMA, Andros TJANDRA, Sakriani SAKTI, IEICE transactions on information and systems . 2021,第10期

机译：代码切换ASR和使用Machine语音链使用半培训学习的TTS
2. Deep Learning Based Part-of-Speech Tagging for Malayalam Twitter Data (Special Issue: Deep Learning Techniques for Natural Language Processing) [J] . S.Kumar, M. AnandKumar, K.P.Soman Journal of Intelligent Systems . 2019,第3期

机译：基于深入学习的Malayalam Twitter数据的语音标记（特殊问题：自然语言处理的深度学习技巧）
3. Python Machine Learning: Machine Learning and Deep Learning With Python, Scikit-Learn, and TensorFlow 2, Third Edition [J] . Yuan Ren International journal of knowledge-based organizations . 2021,第1期

机译：Python机器学习：使用Python，Scikit-Learn和Tensorflow 2，第三版使用Python
4. Machine Speech Chain with Deep Learning [C] . Andros Tjandra, Sakriani Sakti, Satoshi Nakamura 日本音響学会研究発表会 . 2018

机译：机器语音链深入学习
5. Applications of Machine Learning in Supply Chains [D] . Oroojlooy, Afshin. 2019

机译：机器学习在供应链中的应用
6. Evaluation of Hyperparameter Optimization in Machine and Deep Learning Methods for Decoding Imagined Speech EEG [O] . Ciaran Cooney, Attila Korik, Raffaella Folli, 2020

机译：解码中的机器和深层学习方法中的高参数优化评估
7. Part of Speech Tagging in Urdu: Comparison of Machine and Deep Learning Approaches [O] . Wahab Khan, Ali Daud, Khairullah Khan, 2019

机译：乌尔都语中的词性标记部分：机器和深度学习方法的比较

Machine Speech Chain with Deep Learning

摘要

著录项

相似文献

相关主题

期刊订阅