STRUCTURAL KLD FOR CROSS-VARIETY SPEAKER ADAPTATION IN HMM-BASED SPEECH SYNTHESIS

机译：基于HMM的语音合成中跨物种扬声器自适应的结构KLD

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

While the synthesis of natural sounding, neutral stylernspeech can be achieved using today’s technology, fast adaptationrnof speech synthesis to different contexts and situationsrnstill poses a challenge. In the context of varietyrnmodeling (dialects, sociolects) we have to cope with thernproblem that no standardized orthographic form is availablernand that existing speech resources for these varietiesrnare rare. We present recent approaches in the fieldrnof cross-lingual speaker transformation for HMM-basedrnspeech synthesis and propose a method for transforming anrnarbitrary speaker’s voice from one variety to another one.rnWe apply Kullback-Leibler divergence for data mapping ofrnHMM-states, transfer probability density functions to therndecision tree of the other variety and perform speaker adaptation.rnA method to integrate structural information in thernmapping is also presented and analyzed. Subjective listeningrntests show that the proposed method produces speechrnof significantly higher quality than standard speaker adaptationrntechniques.

机译：虽然可以使用当今的技术来实现自然发声，中性风格的语音合成，但快速的语音合成以适应不同的环境和情况仍然是一个挑战。在变体建模（方言，社会学）的背景下，我们必须应对以下问题：没有可用的标准化拼字形式，并且这些变体的现有语音资源很少。我们介绍了基于HMM的语音合成在fieldrnof跨语言说话人转换中的最新方法，并提出了一种将任意说话人的声音从一个变体转换为另一个变体的方法。还提出并分析了一种将结构信息整合到热成像中的方法。主观听觉测试表明，所提出的方法所产生的语音质量比标准的说话人适应技术要高得多。

著录项

来源
《Proceedings of the 10th IASTED international conference on Signal Processing, Pattern Recognition, and Applications》|2013年|382-387|共6页
会议地点 Innsbruck(AU)
作者
Markus E. Toman; Michael Pucher;
展开▼
作者单位

Telecommunications Research Center Vienna (FTW) Vienna, Austria email: toman@ftw.at;

Telecommunications Research Center Vienna (FTW) Vienna, Austria email: pucher@ftw.at;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
speech processing; algorithms and techniques; speech synthesis; speaker adaptation; variety modeling;

机译：语音处理;算法和技术;语音合成;扬声器自适应;多样性建模;

相似文献

外文文献
中文文献
专利

1. Personalising speech-to-speech translation: Unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis [J] . John Dines, Hui Liang, Lakshmi Saheer, Computer speech and language . 2013,第2期

机译：个性化语音到语音翻译：基于HMM的语音合成的无监督跨语言说话者自适应
2. Analysis of Speaker Adaptation Algorithms for HMM-Based Speech Synthesis and a Constrained SMAPLR Adaptation Algorithm [J] . Yamagishi J., Kobayashi T., Nakano Y., IEEE transactions on audio, speech and language processing . 2009,第1期

机译：基于HMM的语音合成的说话人自适应算法和约束SMAPLR自适应算法的分析
3. Frequency Warping for Speaker Adaptation in HMM-based Speech Synthesis [J] . Weixun Gao, Qiying Cao Journal of information science and engineering . 2014,第4期

机译：基于HMM的语音合成中的说话人自适应频率弯曲
4. STRUCTURAL KLD FOR CROSS-VARIETY SPEAKER ADAPTATION IN HMM-BASED SPEECH SYNTHESIS [C] . Markus E. Toman, Michael Pucher IASTED International Conference on Signal Processing, Pattern Recognition and Applications . 2013

机译：基于HMM的语音合成中的跨种类扬声器适应的结构KLD
5. HMM-based non-intrusive speech quality and implementation of Viterbi score distribution and hiddenness based measures to improve the performance of speech recognition [D] . Talwar, Gaurav 2006

机译：基于HMM的非侵入式语音质量以及基于Viterbi分数分布和隐蔽性的措施的实施，以提高语音识别的性能
6. Regularized Speaker Adaptation of KL-HMM for Dysarthric Speech Recognition [O] . Myungjong Kim, Younggwan Kim, Joohong Yoo, -1

机译：KL-HMM的正则化说话人适应用于音调异常语音识别
7. Structural KLD for cross-variety speaker adaptation in HMM-based speech synthesis [O] . Markus E. Toman, Michael Pucher 2013

机译：用于基于Hmm的语音合成中的跨多种说话者适应的结构KLD

STRUCTURAL KLD FOR CROSS-VARIETY SPEAKER ADAPTATION IN HMM-BASED SPEECH SYNTHESIS

摘要

著录项

相似文献

相关主题

期刊订阅