【24h】

Analysis of sad speech and neutral-sad speech conversion

机译:悲伤言论分析与中立悲伤的语音转换

获取原文

摘要

A parameter mapping model to convert neutral speech to sad speech was proposed in this paper by comparison of statistical parameters between neutral and sad speech sample pairs with the same text content. In this model, we found sad speech with generally lower fundamental frequency than neutral speech, and the FO contour is more stable than neutral speech; while the formants of sad speech is slightly higher than neutral speech. When concerning rhythm, the speed of sad speech is slightly slower than neutral speech. There exists significant difference between voiced segments and voiceless segments. Voiceless segments are significantly longer in sad speech. Speech conversion from neutral to sad was realized using this model, and got good results.
机译:在本文中提出了一种参数映射模型,以便在本文中提出了与具有相同文本内容的中性和悲伤语音样本对的统计参数进行统计参数的统计参数。在这一模型中,我们发现悲伤的语音通常比中立的语音较低的基本频率,而FO轮廓比中立语音更稳定;虽然悲伤的讲话的成形略高于中立的言论。当有关节奏时,悲伤的演讲速度略微慢,而不是中立的言论。浊音段和无声段之间存在显着差异。悲伤的演讲中的无声段明显更长。使用此模型实现了从中性到SAD的语音转换,并获得了良好的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号