【24h】

Voice conversion application (VOCAL)

机译:语音转换应用程序(VOCAL)

获取原文

摘要

Recently, a lot of works has been done in speech technology. Text-to-Speech and Automatic Speech Recognition have been the priorities in research efforts to improve the human-machine interaction. The ways to improve naturalness in human-machine interaction is becoming an inportant matter of concern. Voice conversion can be served as a useful tools to provide new insights related to personification of speech enabled systems. In this research, there are two main parameters are considered vocal tract structure and pitch. For conversion process speech is resolved in two components, excitation component and filtered component using Linear Predictive Coding (LPC). Ptich is determined by autocorrelation. After obtained the acoustic components from source speaker and target speaker, then the acoustic components will be mapped one-to-one to replaced the the acoustic feature from source speaker to target speaker. At least, signal is modified by resynthesis so the resulted speech would perceive as if spoken by target speaker.
机译:最近,语音技术已经完成了许多工作。文本语音转换和自动语音识别已成为改善人机交互的研究重点。在人机交互中提高自然性的方法正成为人们关注的重要问题。语音转换可以用作提供与启用语音的系统的拟人化相关的新见解的有用工具。在这项研究中,有两个主要参数被认为是声道结构和音调。对于转换过程,使用线性预测编码(LPC)将语音分解为两个分量:激励分量和滤波分量。 Ptich由自相关确定。在从源说话者和目标说话者获得声学成分之后,声学成分将被一对一映射以替换从源说话者到目标说话者的声学特征。至少,信号通过重新合成进行了修改,因此所得到的语音将被感知为好像由目标说话者说出的话。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号