...
首页> 外文期刊>International journal of speech technology >A HMM-WDLT framework for HNM-based voice conversion with parametric adjustment in formant bandwidth, duration and excitation
【24h】

A HMM-WDLT framework for HNM-based voice conversion with parametric adjustment in formant bandwidth, duration and excitation

机译:HMM-WDLT框架,用于基于HNM的语音转换,并在共振峰带宽,持续时间和激励方面进行参数调整

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

This paper presents a framework, named Hidden Markov Model-Weighted Deviation Linear Transformation (HMM-WDLT), for performing voice conversion based on the Harmonic + Noise Model (HNM). The HMM-WDLT achieves the lowest average spectral distortion in a comparative study of spectral conversion. The problem with broader formant bandwidths can be remedied by a weighting constraint and ordering check with the minimum clearance estimated from the HMM-WDLT. By jointly exploiting the dynamic time warping (DTW) and the HMM-WDLT, the conversion in duration is also feasible. Moreover, the HMM-WDLT plays a part in the conversion of excitation-related parameters such as the fundamental frequency, maximum voiced frequency, and harmonic magnitudes for critical bands below 2.7 kHz. The ability of modifying the pitch and duration concurrently allows the HMM-WDLT to carry out the prosody conversion. Listening tests reveal that the converted speech successfully catches the speaker's individuality with satisfactory quality.
机译:本文提出了一个名为隐马尔可夫模型加权偏差线性变换(HMM-WDLT)的框架,用于基于谐波+噪声模型(HNM)进行语音转换。在对光谱转换的比较研究中,HMM-WDLT实现了最低的平均光谱失真。共振峰带宽更宽的问题可以通过加权约束和具有HMM-WDLT估计的最小间隙的排序检查来解决。通过联合利用动态时间规整(DTW)和HMM-WDLT,持续时间的转换也是可行的。此外,对于低于2.7 kHz的关键频带,HMM-WDLT在激励相关参数(例如基频,最大浊音频率和谐波幅度)的转换中也发挥了作用。同时修改音高和持续时间的能力允许HMM-WDLT进行韵律转换。听力测试表明,转换后的语音以令人满意的质量成功抓住了说话者的个性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号