Building a Speech Database for the Purpose of Speaker Specific Speech Synthesis

机译：建立语音数据库以用于特定于讲话者的语音合成

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents practical and theoretical work carried out in IBM Research Laboratory, during the course of a speech synthesis project. The paper deals with two separate issues. The first is the generation of a compact set of English utterances that will attain a good phonetic coverage of the language. The second issue is constructing a speaker specific database. This starts with the recording of the speaker's speech, modeling it using a highly efficient speech representation and segmenting it into phonemes. The phoneme segmentation process is performed semi-automatically, using an iterative algorithm. A customized software named SPED was developed in order to simplify and speed up the segmentation process and at the same time improve its accuracy.The objective of the methodology presented here is to generate new Voice Fonts" for Text to Speech systems.

机译：本文介绍了语音合成项目过程中在IBM研究实验室中进行的实践和理论工作。该论文涉及两个独立的问题。首先是一代紧凑的英语话语集，这些话语将很好地覆盖该语言。第二个问题是构建演讲者特定的数据库。首先是记录演讲者的语音，然后使用高效的语音表示对其进行建模，然后将其分割为音素。使用迭代算法，半自动执行音素分割过程。开发了名为SPED的定制软件，以简化和加快分割过程，同时提高其准确性。此处介绍的方法的目的是为“文本到语音”系统生成新的“语音字体”。

著录项

来源
《International conference on signal processing;ICSP'96》|1996年|741-744|共4页
会议地点
作者
R. Hoory; N. Shaked; D. Chazan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN911.;
关键词

相似文献

外文文献
中文文献
专利

1. Speech-Input Speech-Output Communication for Dysarthric Speakers Using HMM-Based Speech Recognition and Adaptive Synthesis System [J] . Dhanalakshmi M., Celin T. A. Mariya, Nagarajan T., Circuits, systems, and signal processing . 2018,第2期

机译：基于HMM的语音识别和自适应合成系统的韵律演讲者的语音输入语音输出通信
2. Personalising speech-to-speech translation: Unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis [J] . John Dines, Hui Liang, Lakshmi Saheer, Computer speech and language . 2013,第2期

机译：个性化语音到语音翻译：基于HMM的语音合成的无监督跨语言说话者自适应
3. Speaker clustering using telephone speech database of a large number of speakers [J] . Tsuneo Kato, Shingo Kuroiwa, Tohru Shimizu, 電子情報通信学会技術研究報告. 音声. Speech . 2000,第136期

机译：使用大量演讲者的电话语音数据库进行演讲者聚类
4. Building a speech database for the purpose of speaker specific speech synthesis [C] . Hoory, R., Shaked, . 1996

机译：建立语音数据库以实现特定于说话者的语音合成
5. Building a prosodically sensitive diphone database for a Korean text-to-speech synthesis system. [D] . Yoon, Kyuchul. 2005

机译：为韩国文字转语音合成系统建立一个对韵律敏感的diphone数据库。
6. Speech-on-speech masking with variable access to the linguistic content of the masker speech for native and non-native speakers of English [O] . Lauren Calandruccio, Ann R. Bradlow, Sumitrajit Dhar -1

机译：语音对语音的掩盖功能可为母语为英语和非母语的英语使用者提供对掩蔽者语音的语言内容的可变访问权限
7. Speaker-Adaptive Speech Synthesis Based on Eigenvoice Conversion and Language-Dependent Prosodic Conversion in Speech-to-Speech Translation [O] . Nobuhiko Hattori, Tomoki Toda, Hisashi Kawai, 2011

机译：语音转换中基于特征语音转换和语言相关韵律转换的说话人自适应语音合成

Building a Speech Database for the Purpose of Speaker Specific Speech Synthesis

摘要

著录项

相似文献

相关主题

期刊订阅