首页> 外文会议>International Speech Communication Association >Structure to Speech Conversion– Speech Generation Based on Infant-like Vocal Imitation –
【24h】

Structure to Speech Conversion– Speech Generation Based on Infant-like Vocal Imitation –

机译:基于婴儿的声乐模仿的语音转换 - 语音生成的结构 -

获取原文
获取外文期刊封面目录资料

摘要

This paper proposes a new framework of speech generation by imitating "infants' vocal imitation". Most of the speech synthe-sizers take a phoneme sequence as input and generate speech by converting each of the phonemes into a sound sequentially. In other words, they simulate a human process of reading text out. However, infants usually acquire speech generation abil-ity without text or phoneme sequences. Since their phonemic awareness is very immature, they can hardly decompose a word utterance into a sequence of phones. In this situation, as devel-opmental psychology states, infants acquire the holistic sound pattern of words from the utterances of their parents, called word Gestalt, and they reproduce it with their vocal tubes. This behavior is called vocal imitation. In our previous studies, the word Gestalt was defined physically and a method of extract-ing it from an utterance was proposed and used successfully for ASR and CALL. In this paper, a method of converting the word Gestalt back to speech is proposed and evaluated. Unlike a read-ing machine, our proposal simulates infants' vocal imitation.
机译:本文通过模仿“婴儿的声乐仿”提出了一种新的语音一代框架。大多数语音合成仪通过称号序列作为输入,通过顺序将每个音素转换为声音来生成语音。换句话说,他们模拟了阅读文本的人类过程。然而,婴儿通常会在没有文本或音素序列的情况下获得语音一代ybil-ity。由于他们的音素意识非常不成熟,因此它们几乎无法分解为一系列手机。在这种情况下,作为开发的待遇心理状态,婴儿从父母的话语中获取整体声音模式,称为Word Gestalt,他们用他们的声音重现它。此行为称为声乐模仿。在我们以前的研究中,GestAlt在物理上定义了Gestalt,提出了一种从话语中提取它的方法,并成功地用于ASR并呼叫。在本文中,提出了一种将gestalt返回语音转换为语音的方法。与读取机器不同,我们的提案模拟了婴儿的声音模仿。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号