首页> 外文会议>International conference on spoken language processing >Unit Fusion for concatenative Speech Synthesis
【24h】

Unit Fusion for concatenative Speech Synthesis

机译:单位融合用于连续语音合成

获取原文

摘要

An important prolem in concatenative synthesis is the occurence of spectral discontinuities or "concatenation mismatch" between sonorant speech units. In this paper, we present an approach to reduce concatenation mismatch by combining spectral information from two sequences of speech units selected in parallel. concatenation units, on one hand, define initial spectral trajectories for a target utterance. Fusion units, on the other hand, define the desired transitions between concatenated units. The two unit sequences are "fused" by imposing dynamic constraints defined by the fusion units on the spectral trajectories of the concatenation units. To regenerate the modified speech units, we use a synthesis algorithm based on sinusoidal + all-pole analysis of speech, which overcomes the limitations of residual-excited LPC. Results from a perceptual test show that our mehtod is highly successful at removing concatenation artifacts in speech generated from an inventory of diphones.
机译:连接性合成中的一个重要蠕变是声学语音单元之间的光谱不连续性或“连接不匹配”的发生。在本文中,我们提出了一种通过与并行选择的两个语音单元序列组合的光谱信息来减少连接不匹配的方法。一方面,级联单元定义目标话语的初始光谱轨迹。另一方面,融合单元定义了连接单元之间所需的转换。通过施加由融合单元定义的级联单元的光谱轨迹限定的动态约束,两个单元序列是“融合”。为了再生修改后的语音单元,我们使用基于正弦+全极分析的合成算法克服了残留兴奋LPC的局限性。感知测试结果表明,我们的Mehtod在从迪维斯清单中消除了言语中的替代工件。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号