【24h】

Voice conversion application (VOCAL)

机译:语音转换应用程序(声音)

获取原文

摘要

Recently, a lot of works has been done in speech technology. Text-to-Speech and Automatic Speech Recognition have been the priorities in research efforts to improve the human-machine interaction. The ways to improve naturalness in human-machine interaction is becoming an inportant matter of concern. Voice conversion can be served as a useful tools to provide new insights related to personification of speech enabled systems. In this research, there are two main parameters are considered vocal tract structure and pitch. For conversion process speech is resolved in two components, excitation component and filtered component using Linear Predictive Coding (LPC). Ptich is determined by autocorrelation. After obtained the acoustic components from source speaker and target speaker, then the acoustic components will be mapped one-to-one to replaced the the acoustic feature from source speaker to target speaker. At least, signal is modified by resynthesis so the resulted speech would perceive as if spoken by target speaker.
机译:最近,在语音技术中完成了很多作品。文本到语音和自动语音识别是研究努力改善人机交互的优先事项。提高人机相互作用自然度的方法正在成为一个令人担忧的重要问题。语音转换可以作为一种有用的工具,以提供与支持语音系统的人人化相关的新见解。在这项研究中,有两个主要参数被认为是声带结构和俯仰。对于使用线性预测编码(LPC),在两个组件,激励分量和过滤组件中解析转换过程语音。 Ptich由自相关确定。在从源扬声器和目标扬声器获得声学组分之后,然后将声学分量一对一映射以将声学特征从源扬声器替换为目标扬声器。至少,信号由重新交换修改,因此由此产生的语音会感知到目标扬声器的说法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号