首页> 外文期刊>Computer speech and language >Manipulation of the prosodic features of vocal tract length, nasality and articulatory precision using articulatory synthesis
【24h】

Manipulation of the prosodic features of vocal tract length, nasality and articulatory precision using articulatory synthesis

机译:使用发音合成来操纵声带长度,鼻腔和发音精度的韵律特征

获取原文
获取原文并翻译 | 示例

摘要

Vocal emotions, as well as different speaking styles and speaker traits, are characterized by a complex interplay of multiple prosodic features. Natural sounding speech synthesis with the ability to control such paralinguistic aspects requires the manipulation of the corresponding prosodic features. With traditional concatenative speech synthesis it is easy to manipulate the "primary" prosodic features pitch, duration, and intensity, but it is very hard to individually control "secondary" prosodic features like phona-tion type, vocal tract length, articulatory precision and nasality. These secondary features can be controlled more directly with parametric synthesis methods. In the present study we analyze the ability of articulatory speech synthesis to control secondary prosodic features by rule. To this end, nine German words were re-synthesized with the software VocalTractLab 2.1 and then manipulated in different ways at the articulatory level to vary vocal tract length, articulatory precision and degree of nasality. Listening tests showed that most of the intended prosodic manipulations could be reliably identified with recognition rates between 77% and 96%. Only the manipulations to increase articulatory precision were hardly recognized. The results suggest that rule-based manipulations in articulatory synthesis are generally sufficient for the convincing synthesis of secondary prosodic features at the word level.
机译:声音的情感,以及不同的说话风格和说话者特征,都具有多种韵律特征的复杂相互作用。具有控制这样的副语言方面的能力的自然听起来语音合成需要操纵相应的韵律特征。使用传统的级联语音合成,可以轻松地控制“主要”韵律特征的音调,持续时间和强度,但是很难单独控制“辅助”韵律特征,例如发声类型,声道长度,发音精度和鼻音。这些次要特征可以使用参数综合方法直接控制。在本研究中,我们分析了发音语音合成通过规则控制次要韵律特征的能力。为此,使用软件VocalTractLab 2.1重新合成了9个德语单词,然后在发音水平上以不同的方式进行操作,以改变声道长度,发音精确度和鼻音度。听力测试表明,大多数预期的韵律操作都可以可靠地识别,识别率在77%至96%之间。几乎没有人认识到只有增加咬合精度的操作。结果表明,在语音合成中基于规则的操作通常足以在单词级别上令人信服地合成辅助韵律特征。

著录项

  • 来源
    《Computer speech and language》 |2017年第1期|116-127|共12页
  • 作者单位

    Institute of Acoustics and Speech Communication, Technische Universitaet Dresden, 01062 Dresden, Germany;

    Department of Phoniatrics, Pedaudiology and Communication Disorders, University Hospital Aachen and RWTH Aachen University, Pauwelsstr. 30, 52074 Aachen, Germany;

    Department of Speech, Hearing and Phonetic Sciences, University College London, Chandler House, 2 Wakefield Street, London;

    Department of Psychology, Technische Universitaet Dresden, 01062 Dresden, Germany;

    Department of Phoniatrics, Pedaudiology and Communication Disorders, University Hospital Aachen and RWTH Aachen University, Pauwelsstr. 30, 52074 Aachen, Germany;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Prosody; Feature manipulation; Articulatory synthesis;

    机译:韵律特征操纵;关节合成;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号