Structure to Speech Conversion– Speech Generation Based on Infant-like Vocal Imitation –

机译：基于婴儿的声乐模仿的语音转换 - 语音生成的结构 -

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper proposes a new framework of speech generation by imitating "infants' vocal imitation". Most of the speech synthe-sizers take a phoneme sequence as input and generate speech by converting each of the phonemes into a sound sequentially. In other words, they simulate a human process of reading text out. However, infants usually acquire speech generation abil-ity without text or phoneme sequences. Since their phonemic awareness is very immature, they can hardly decompose a word utterance into a sequence of phones. In this situation, as devel-opmental psychology states, infants acquire the holistic sound pattern of words from the utterances of their parents, called word Gestalt, and they reproduce it with their vocal tubes. This behavior is called vocal imitation. In our previous studies, the word Gestalt was defined physically and a method of extract-ing it from an utterance was proposed and used successfully for ASR and CALL. In this paper, a method of converting the word Gestalt back to speech is proposed and evaluated. Unlike a read-ing machine, our proposal simulates infants' vocal imitation.

机译：本文通过模仿“婴儿的声乐仿”提出了一种新的语音一代框架。大多数语音合成仪通过称号序列作为输入，通过顺序将每个音素转换为声音来生成语音。换句话说，他们模拟了阅读文本的人类过程。然而，婴儿通常会在没有文本或音素序列的情况下获得语音一代ybil-ity。由于他们的音素意识非常不成熟，因此它们几乎无法分解为一系列手机。在这种情况下，作为开发的待遇心理状态，婴儿从父母的话语中获取整体声音模式，称为Word Gestalt，他们用他们的声音重现它。此行为称为声乐模仿。在我们以前的研究中，GestAlt在物理上定义了Gestalt，提出了一种从话语中提取它的方法，并成功地用于ASR并呼叫。在本文中，提出了一种将gestalt返回语音转换为语音的方法。与读取机器不同，我们的提案模拟了婴儿的声音模仿。

著录项

来源
《International Speech Communication Association》|2008年||共4页
会议地点
作者
Daisuke Saito; Satoshi Asakawa; Nobuaki Minematsu; Keikichi Hirose;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912.3-532;
关键词
speech synthesis; vocal imitation; word Gestalt; invariant structure; Bhattacharyya distance; searching problem;

机译：语音合成;声乐仿;单词gestalt;不变的结构;bhattacharyya距离;搜索问题;

相似文献

外文文献
中文文献
专利

1. Speech Scrambling Based on Imitation of a Target Speech Signal with Non-confidential Content [J] . Dora M. Ballesteros L, Juan M. Moreno A Circuits, systems, and signal processing . 2014,第11期

机译：基于具有非机密内容的目标语音信号的语音扰码
2. Vocal Tract Images Reveal Neural Representations of Sensorimotor Transformation During Speech Imitation [J] . DanielCarey, Marc E.Miquel, Bronwen G.Evans, Cerebral cortex . 2017,第5期

机译：声带图像在语音模仿中揭示了传感器转换的神经表示
3. Vocal imitation of song and speech [J] . Mantell J.T., Pfordresher P.Q. Cognition: International Journal of Cognitive Psychology . 2013,第2期

机译：声乐模仿
4. Structure to Speech Conversion– Speech Generation Based on Infant-like Vocal Imitation – [C] . Daisuke Saito, Satoshi Asakawa, Nobuaki Minematsu, International Speech Communication Association . 2008

机译：基于婴儿的声乐模仿的语音转换 - 语音生成的结构 -
5. A knowledge-based message generation system for motor and speech disabled persons: Design methodology and prototype testing. [D] . Sy, Bon Kiem. 1988

机译：基于知识的运动和语言障碍者消息生成系统：设计方法和原型测试。
6. Individual Differences in Audio-Vocal Speech Imitation Aptitude in Late Bilinguals: Functional Neuro-Imaging and Brain Morphology [O] . Susanne Maria Reiterer, Xiaochen Hu, Michael Erb, 2011

机译：晚期双语者在声乐语音模仿能力上的个体差异：功能性神经影像学和脑形态学
7. Speaker-Adaptive Speech Synthesis Based on Eigenvoice Conversion and Language-Dependent Prosodic Conversion in Speech-to-Speech Translation [O] . Nobuhiko Hattori, Tomoki Toda, Hisashi Kawai, 2011

机译：语音转换中基于特征语音转换和语言相关韵律转换的说话人自适应语音合成
8. Design and Assessment of Computer-Based Speech Training Aids Using Vocal Tract Displays [R] . Bristow, G. J. , Brooks, S. , Fallside, F. , 1977

机译：使用声乐显示器设计和评估基于计算机的语音训练辅助工具

Structure to Speech Conversion– Speech Generation Based on Infant-like Vocal Imitation –

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅