首页> 外文会议>International Conference on Electrical and Computer Engineering >A TDPSOLA based concatenation technique for Bengali text to speech synthesis system Subachan
【24h】

A TDPSOLA based concatenation technique for Bengali text to speech synthesis system Subachan

机译:基于TDPSola的孟加拉文本转向语音合成系统Subachan

获取原文

摘要

Creating an intelligible as well as a natural text to speech synthesizer has been the ultimate goal of researchers for the past 30 years; and concatenative synthesis provides the most natural speech. It is usual to have distortions in the concatenation points in concatenative speech synthesis, and therefore generating audible clicks in the synthesized speech. To solve this problem, several signal processing concatenation algorithms exist, such as TDPSOLA, FDPSOLA, MBROLA etc. This paper addresses the problem of audible discontinuities at the concatenation points of diphones in Bengali speech synthesizer Subachan and solving it using TDPSOLA. In the process of doing this, we detected correct pitch mark locations of diphones, detected voiced and unvoiced speech frames of diphones and finally concatenated those diphones using TDPSOLA after rescaling them to remove energy mismatches. As a result, the audible clicks in the concatenation points are removed and speech with much better quality is generated.
机译:创造一个可理解的和自然文本到语音合成器一直是过去30年的研究人员的最终目标;和连接性综合提供了最自然的演讲。通常在连接语音合成中的级联点中扭曲,因此在合成语音中产生可听点击。为了解决这个问题,存在若干信号处理级联算法,例如TDPSola,FDPSola,Mbrola等。本文涉及孟加拉语音合成器Sabachan在孟加拉语音合成器Sabachan中的Diphones级联点的声音不连续性问题,并使用TDPSola解决它。在这样做的过程中,我们检测到偶像的正确俯仰标记位置,检测到偶像的浊音和清晰的语音帧,最后在重新加工后使用TDPSola连接那些双髻块以去除能量不匹配。结果,删除了串联点中的可听点击,并生成具有更好质量的语音。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号