首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Pitch adaptive training for hmm-based singing voice synthesis
【24h】

Pitch adaptive training for hmm-based singing voice synthesis

机译:基于HMM的歌唱语音合成的音高自适应培训

获取原文

摘要

A statistical parametric approach to singing voice synthesis based on hidden Markov Models (HMMs) has been growing in popularity over the last few years. The spectrum, excitation, vibrato, and duration of singing voices in this approach are simultaneously modeled with context-dependent HMMs and waveforms are generated from the HMMs themselves. HMM-based singing voice synthesis systems are heavily based on the training data in performance because these systems are "corpus-based." Therefore, HMMs corresponding to contextual factors that hardly ever appear in the training data cannot be well-trained. Pitch should especially be correctly covered since generated F0 trajectories have a great impact on the subjective quality of synthesized singing voices. We applied the method of "speaker adaptive training" (SAT) to "pitch adaptive training," which is discussed in this paper. This technique made it possible to normalize pitch based on musical notes in the training process. The experimental results demonstrated that the proposed technique could alleviate the data sparseness problem.
机译:基于隐马尔可夫模型(HMMS)唱歌语音合成的统计参数方法在过去几年中越来越受欢迎。这种方法中的谱,激发,颤音和唱歌的歌声和持续时间同时用上下文相关的HMMS建模,并且从HMMS本身产生波形。基于HMM的歌唱语音合成系统严重基于性能的培训数据,因为这些系统是“基于语料库”。因此,对应于训练数据几乎没有出现的上下文因素的HMM不能训练有素。尤其应该被正确地覆盖音高,因为生成的F0轨迹对合成唱歌声音的主观质量产生了很大影响。我们将“扬声器自适应培训”(SAT)应用于本文讨论的“音频自适应培训”(SAT)至“俯仰自适应培训”。该技术使得基于训练过程中的音符来标准化间距。实验结果表明,所提出的技术可以减轻数据稀疏问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号