...
首页> 外文期刊>電子情報通信学会技術研究報告. 音声. Speech >Rule-Based Speech Morphing for Verification of Emotional Perception Model
【24h】

Rule-Based Speech Morphing for Verification of Emotional Perception Model

机译:基于规则的语音变形用于情感感知模型的验证

获取原文
获取原文并翻译 | 示例

摘要

This paper reports rules for morphing speech voice to make it can be perceived as different primitive features, for example, to make it sounds more like "bright" or more like "dark". For the perception of emotional speech, we have proposed a three-layered model, which contains five categories of emotional speech, primitive features, and acoustic features, in previous work. The concept is based on that we assume that human perceive emotion from speech is according to a combination of different primitive features that listeners give to the utterance they hear. Based on experiments and acoustic analysis, we have built the relationships between the three layers by a top-down method and reported that such relationships are significant for perception of emotional speech. In order to verify the relationships, a bottom-up method is adopted. That is to morph (resynthesize) speech voice by composing acoustic features in the bottommost layer to produce speech voice with perception of single or multiple primitive features, which can be further perceived as different categories of emotion. To this end, the rules of morphing speech to make it be perceived as different primitive features should be established first. This paper describes how the rules were established and presents the principles and the strategy of rule creation. An experiment was conducted to evaluate the performance of rules. The results of the experiment indicate the proposed principles and strategy are a direction to create rules of primitive feature. And, the results also confirm the relationships between primitive features and acoustic features built previously are related and effective.
机译:本文报告了语音变型规则,以使其可以被感知为不同的原始特征,例如,使其听起来更像“明亮”或“更暗”。为了感知情感言语,我们在先前的工作中提出了一个三层模型,其中包含情感言语,原始特征和声学特征五类。该概念基于以下假设:我们假设人类从语音中感知到的情感是听众根据其听到的话语所赋予的不同原始特征的组合。基于实验和声学分析,我们通过自上而下的方法建立了三层之间的关系,并报告了这种关系对于情感言语的感知具有重要意义。为了验证关系,采用了自下而上的方法。也就是说,通过在最底层构成声学特征来变形(重新合成)语音,以产生具有单个或多个原始特征的感知的语音,可以进一步将其感知为不同类别的情感。为此,应该首先建立使语音变体以使其被感知为不同原始特征的规则。本文介绍了规则的建立方式,并介绍了规则创建的原理和策略。进行了一项评估规则性能的实验。实验结果表明所提出的原理和策略是创建原始特征规则的方向。并且,结果还证实了原始特征与先前建立的声学特征之间的关系是相关且有效的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号