Manipulation of the prosodic features of vocal tract length, nasality and articulatory precision using articulatory synthesis

Peter Birkholz; Lucia Martin; Yi Xu; Stefan Scherbaum; Christiane Neuschaefer-Rube

首页> 外文期刊>Computer speech and language >Manipulation of the prosodic features of vocal tract length, nasality and articulatory precision using articulatory synthesis

【24h】

Manipulation of the prosodic features of vocal tract length, nasality and articulatory precision using articulatory synthesis

机译：使用发音合成来操纵声带长度，鼻腔和发音精度的韵律特征

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Vocal emotions, as well as different speaking styles and speaker traits, are characterized by a complex interplay of multiple prosodic features. Natural sounding speech synthesis with the ability to control such paralinguistic aspects requires the manipulation of the corresponding prosodic features. With traditional concatenative speech synthesis it is easy to manipulate the "primary" prosodic features pitch, duration, and intensity, but it is very hard to individually control "secondary" prosodic features like phona-tion type, vocal tract length, articulatory precision and nasality. These secondary features can be controlled more directly with parametric synthesis methods. In the present study we analyze the ability of articulatory speech synthesis to control secondary prosodic features by rule. To this end, nine German words were re-synthesized with the software VocalTractLab 2.1 and then manipulated in different ways at the articulatory level to vary vocal tract length, articulatory precision and degree of nasality. Listening tests showed that most of the intended prosodic manipulations could be reliably identified with recognition rates between 77% and 96%. Only the manipulations to increase articulatory precision were hardly recognized. The results suggest that rule-based manipulations in articulatory synthesis are generally sufficient for the convincing synthesis of secondary prosodic features at the word level.

机译：声音的情感，以及不同的说话风格和说话者特征，都具有多种韵律特征的复杂相互作用。具有控制这样的副语言方面的能力的自然听起来语音合成需要操纵相应的韵律特征。使用传统的级联语音合成，可以轻松地控制“主要”韵律特征的音调，持续时间和强度，但是很难单独控制“辅助”韵律特征，例如发声类型，声道长度，发音精度和鼻音。这些次要特征可以使用参数综合方法直接控制。在本研究中，我们分析了发音语音合成通过规则控制次要韵律特征的能力。为此，使用软件VocalTractLab 2.1重新合成了9个德语单词，然后在发音水平上以不同的方式进行操作，以改变声道长度，发音精确度和鼻音度。听力测试表明，大多数预期的韵律操作都可以可靠地识别，识别率在77％至96％之间。几乎没有人认识到只有增加咬合精度的操作。结果表明，在语音合成中基于规则的操作通常足以在单词级别上令人信服地合成辅助韵律特征。

著录项

来源
《Computer speech and language》 |2017年第1期|116-127|共12页
作者
Peter Birkholz; Lucia Martin; Yi Xu; Stefan Scherbaum; Christiane Neuschaefer-Rube;
展开▼
作者单位

Institute of Acoustics and Speech Communication, Technische Universitaet Dresden, 01062 Dresden, Germany;

Department of Phoniatrics, Pedaudiology and Communication Disorders, University Hospital Aachen and RWTH Aachen University, Pauwelsstr. 30, 52074 Aachen, Germany;

Department of Speech, Hearing and Phonetic Sciences, University College London, Chandler House, 2 Wakefield Street, London;

Department of Psychology, Technische Universitaet Dresden, 01062 Dresden, Germany;

Department of Phoniatrics, Pedaudiology and Communication Disorders, University Hospital Aachen and RWTH Aachen University, Pauwelsstr. 30, 52074 Aachen, Germany;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Prosody; Feature manipulation; Articulatory synthesis;

机译：韵律特征操纵;关节合成;

相似文献

外文文献
中文文献
专利

1. Mapping Articulatory-Features to Vocal-Tract Parameters for Voice Conversion [J] . Narpendyah Wisjnu ARIWARDHANI, Masashi KIMURA, Yurie IRIBE, IEICE transactions on information and systems . 2014,第4期

机译：将发音特征映射到人声参数以进行语音转换
2. Genetic learning of vocal tract area functions for articulatory synthesis of Spanish vowels [J] . Jose Brito Applied Soft Computing . 2007,第1a4期

机译：西班牙元音发音合成中的声道区域功能的遗传学习。
3. Estimation of dynamic vocal tract shape for VCV sound using a method of analysis-by-synthesis in articulatory-acoustic transformation [J] . Shozo Goto, Jouji Miwa 電子情報通信学会技術研究報告. 音声. Speech . 2002,第619期

机译：使用发音-声学变换中的合成分析方法估计VCV声音的动态声道形状
4. Acoustic-to-Articulatory Mapping Codebook Constraint for Determining Vocal-tract Length for Inverse Speech Problem and Articulatory Synthesis [C] . Zhen-li Yu, Shang-cui Zeng 16~(th) World Computer Congress 2000 and 2000 5~(th) International Conference on Signal Processing Proceedings Vol.II August 21-25, 2000, Beijing, China . 2000

机译：声学到发音映射码本约束，用于确定语音反演问题和发音合成的声部长度
5. Articulatory gestures and Spanish nasal assimilation. [D] . Honorof, Douglas Nathan. 1999

机译：发音手势和西班牙人的鼻吸收。
6. Automatic speech recognition using articulatory features from subject-independent acoustic-to-articulatory inversion [O] . Prasanta Kumar Ghosh, Shrikanth Narayanan -1

机译：使用从独立于受试者的声学到发音反转的发音特征进行自动语音识别
7. Manipulation of the prosodic features of vocal tract length, nasality and articulatory precision using articulatory synthesis [O] . Peter Birkholz, Lucia Martin, Yi Xu, 2017

机译：用铰接合成操纵声带长度，鼻腔和清晰度精度的韵律特征

Manipulation of the prosodic features of vocal tract length, nasality and articulatory precision using articulatory synthesis

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅