首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >An estimation method of voice timbre evaluation values using feature extraction with Gaussian mixture model based on reference singer
【24h】

An estimation method of voice timbre evaluation values using feature extraction with Gaussian mixture model based on reference singer

机译:基于参考歌手的高斯混合模型特征提取语音音色评价值的估计方法

获取原文

摘要

This paper presents an estimation method of voice timbre evaluation values for arbitrary singer's singing voices generated with a singing voice synthesis system towards the development of a singing voice retrieval system. The voice timbre evaluation values are numerical values corresponding to voice timbre expression words, such as "Age" and "Gender", and they usually need to be manually assigned to individual singers' singing voices through listening. To make it possible to automatically estimate them from given singer's singing voices, an acoustic feature to well capture only each singer's voice timbre is extracted with a Gaussian mixture model trained using parallel data between singing voices sung by many pre-stored target singers and same voices sung by a reference singer. Then, the voice timbre evaluation values are estimated from the extracted feature using regression models. The experimental results showed that the proposed method is capable of accurately estimating those values for some expression words, such as "Age" and "Gender", and nonlinear regression is effective for the expression words, "Powerfulness" and "Uniqueness."
机译:本文提出了一种用歌声合成系统生成的任意歌手的歌声的音色评价值的估计方法,以发展歌声检索系统。语音音色评估值是与语音音色表达词相对应的数值,例如“年龄”和“性别”,并且通常需要通过收听将它们手动分配给各个歌手的歌声。为了能够从给定歌手的歌声中自动估计声音,使用高斯混合模型提取了仅能很好地捕获每个歌手的声音音色的声学功能,该模型使用了许多预先存储的目标歌手和相同声音演唱的歌声之间的并行数据进行训练由参考歌手演唱。然后,使用回归模型从提取的特征中估计语音音色评估值。实验结果表明,所提出的方法能够准确地估计“年龄”和“性别”等某些表达词的值,非线性回归对于“有力”和“唯一性”等表达词是有效的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号