首页> 外文会议>2018 First Asian Conference on Affective Computing and Intelligent Interaction >Controlling Expressivity using Input Codes in Neural Network based TTS
【24h】

Controlling Expressivity using Input Codes in Neural Network based TTS

机译:在基于神经网络的TTS中使用输入代码控制表达

获取原文
获取原文并翻译 | 示例

摘要

This paper presents a study on the use of input codes in the neural network acoustic modeling for expressive TTS. Specifically, we use different kinds of input codes, augmented with the linguistic features, as the input of a BLSTM-based acoustic model, to control the expressivity of the synthesized speech. The input codes, in one-hot representation, include dialogue code, sentiment code and sentence position code. The dialogue code indicates whether the text is a dialogue or narration in an audiobook story. The sentiment code is obtained from a sentiment analysis tool, which labels each sentence as positive, negative and neutral. The sentence position code indicates the position of the sentence in the paragraph. We believe these codes are highly related to the expressiveness of the audiobook speech. Experiments on the data from the Blizzard Challenge 2017 demonstrate the effectiveness of the use of input codes in the neural network approach for expressive TTS.
机译:本文介绍了在表达性TTS的神经网络声学建模中使用输入代码的研究。具体来说,我们使用不同种类的输入代码(加上语言功能)作为基于BLSTM的声学模型的输入,以控制合成语音的表达能力。输入代码以一键式表示,包括对话代码,情感代码和句子位置代码。对话代码指示文本是有声读物故事中的对话还是旁白。情感代码是从情感分析工具获得的,该工具将每个句子分别标记为肯定,否定和中立。句子位置代码指示句子在段落中的位置。我们认为这些代码与有声书语音的表达能力高度相关。来自Blizzard Challenge 2017的数据实验表明,在表达性TTS的神经网络方法中使用输入代码的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号