A HMM-WDLT framework for HNM-based voice conversion with parametric adjustment in formant bandwidth, duration and excitation

Hwai-Tsu Hu; Chu Yu

首页> 外文期刊>International journal of speech technology >A HMM-WDLT framework for HNM-based voice conversion with parametric adjustment in formant bandwidth, duration and excitation

【24h】

A HMM-WDLT framework for HNM-based voice conversion with parametric adjustment in formant bandwidth, duration and excitation

机译：HMM-WDLT框架，用于基于HNM的语音转换，并在共振峰带宽，持续时间和激励方面进行参数调整

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents a framework, named Hidden Markov Model-Weighted Deviation Linear Transformation (HMM-WDLT), for performing voice conversion based on the Harmonic + Noise Model (HNM). The HMM-WDLT achieves the lowest average spectral distortion in a comparative study of spectral conversion. The problem with broader formant bandwidths can be remedied by a weighting constraint and ordering check with the minimum clearance estimated from the HMM-WDLT. By jointly exploiting the dynamic time warping (DTW) and the HMM-WDLT, the conversion in duration is also feasible. Moreover, the HMM-WDLT plays a part in the conversion of excitation-related parameters such as the fundamental frequency, maximum voiced frequency, and harmonic magnitudes for critical bands below 2.7 kHz. The ability of modifying the pitch and duration concurrently allows the HMM-WDLT to carry out the prosody conversion. Listening tests reveal that the converted speech successfully catches the speaker's individuality with satisfactory quality.

机译：本文提出了一个名为隐马尔可夫模型加权偏差线性变换（HMM-WDLT）的框架，用于基于谐波+噪声模型（HNM）进行语音转换。在对光谱转换的比较研究中，HMM-WDLT实现了最低的平均光谱失真。共振峰带宽更宽的问题可以通过加权约束和具有HMM-WDLT估计的最小间隙的排序检查来解决。通过联合利用动态时间规整（DTW）和HMM-WDLT，持续时间的转换也是可行的。此外，对于低于2.7 kHz的关键频带，HMM-WDLT在激励相关参数（例如基频，最大浊音频率和谐波幅度）的转换中也发挥了作用。同时修改音高和持续时间的能力允许HMM-WDLT进行韵律转换。听力测试表明，转换后的语音以令人满意的质量成功抓住了说话者的个性。

著录项

来源
《International journal of speech technology》 |2012年第2期|p.215-225|共11页
作者
Hwai-Tsu Hu; Chu Yu;
展开▼
作者单位

Department of Electronic Engineering, National 1-Lan University,I-Lan, Taiwan, ROC;

Department of Electronic Engineering, National 1-Lan University,I-Lan, Taiwan, ROC;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
voice conversion; harmonic + noise model; hidden markov model; weighted deviation linear transform;

机译：语音转换;谐波+噪声模型;隐藏的马尔可夫模型;加权偏差线性变换;

相似文献

外文文献
中文文献
专利

1. Parametric Formant Modelling and Transformation in Voice Conversion [J] . DIMITRIOS RENTZOS, SAEED VASEGHI, QIN YAN, International journal of speech technology . 2005,第3期

机译：语音转换中的参数共振峰建模和转换
2. Effects of vocalic duration and first formant offset on final voicing judgments by children and adults [J] . Jones C The Journal of the Acoustical Society of America . 2005,第6期

机译：语音持续时间和第一共振峰偏移对儿童和成人最终发音判断的影响
3. A Modified Adaptive Algorithm for Formant Bandwidth in Whisper Conversion [J] . Gang Lv, Heming Zhao Computer and Information Science . 2008,第4期

机译：耳语转换中共振峰带宽的改进自适应算法
4. Parametric Speech Coding Framework for Voice Conversion Based on Mixed Excitation Model [C] . Michal Lenarczyk International conference on text, speech and dialogue . 2014

机译：基于混合激励模型的语音转换参数语音编码框架
5. Modeling Viral Suppression Viral Rebound and State-Specific Duration of HIV Patients with CD4 Count Adjustment: Parametric Multistate Frailty Model Approach [O] . Zelalem G. Dessie, Temesgen Zewotir, Henry Mwambi, 2020

机译：通过CD4计数调整对HIV患者的病毒抑制病毒反弹和特定状态持续时间进行建模：参数多状态脆弱模型方法
6. A Modified Adaptive Algorithm for Formant Bandwidth in Whisper Conversion [O] . Heming Zhao, Gang Lv 2009

机译：耳语转换中共振峰带宽的改进自适应算法

A HMM-WDLT framework for HNM-based voice conversion with parametric adjustment in formant bandwidth, duration and excitation

摘要

著录项

相似文献

相关主题

期刊订阅