首页> 外文会议>International conference on signal processing;ICSP'96 >Building a Speech Database for the Purpose of Speaker Specific Speech Synthesis
【24h】

Building a Speech Database for the Purpose of Speaker Specific Speech Synthesis

机译:建立语音数据库以用于特定于讲话者的语音合成

获取原文

摘要

This paper presents practical and theoretical work carried out in IBM Research Laboratory, during the course of a speech synthesis project. The paper deals with two separate issues. The first is the generation of a compact set of English utterances that will attain a good phonetic coverage of the language. The second issue is constructing a speaker specific database. This starts with the recording of the speaker's speech, modeling it using a highly efficient speech representation and segmenting it into phonemes. The phoneme segmentation process is performed semi-automatically, using an iterative algorithm. A customized software named SPED was developed in order to simplify and speed up the segmentation process and at the same time improve its accuracy.The objective of the methodology presented here is to generate new Voice Fonts" for Text to Speech systems.
机译:本文介绍了语音合成项目过程中在IBM研究实验室中进行的实践和理论工作。该论文涉及两个独立的问题。首先是一代紧凑的英语话语集,这些话语将很好地覆盖该语言。第二个问题是构建演讲者特定的数据库。首先是记录演讲者的语音,然后使用高效的语音表示对其进行建模,然后将其分割为音素。使用迭代算法,半自动执行音素分割过程。开发了名为SPED的定制软件,以简化和加快分割过程,同时提高其准确性。 此处介绍的方法的目的是为“文本到语音”系统生成新的“语音字体”。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号