A Text-to-Speech Platform for Variable Length Optimal Unit Searching Using Perception Based Cost Functions

MINKYU LEE; DANIEL P. LOPRESTI; JOSEPH P. OLIVE

首页> 外文期刊>International journal of speech technology >A Text-to-Speech Platform for Variable Length Optimal Unit Searching Using Perception Based Cost Functions

【24h】

A Text-to-Speech Platform for Variable Length Optimal Unit Searching Using Perception Based Cost Functions

机译：基于感知的成本函数的变长最佳单位搜索的文本语音平台

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In concatenative Text-to-Speech, the size of the speech corpus is closely related to synthetic speech quality. In this paper, we describe our work on a new corpus-based Bell Labs' TTS system. This encompasses large acoustic inventories with a rich set of annotations, models and data structures for representing and managing such inventories, and an optimal unit selection algorithm that accommodates a broad range of possible cost criteria. We also propose a new method for setting weights in the cost functions based on a perceptual preference test. Our results show that this approach can successfully predict human preference patterns. Synthetic speech using weights determined in this manner consistently demonstrates smoother transitions and higher voice quality than speech using manually set weights.

机译：在串联文本到语音中，语音语料库的大小与合成语音质量密切相关。在本文中，我们描述了基于新语料库的Bell Labs TTS系统的工作。这包括具有丰富注释，模型和数据结构的大量声学清单，用于表示和管理此类清单，以及可适应各种可能成本标准的最佳单位选择算法。我们还提出了一种基于感知偏好测试在成本函数中设置权重的新方法。我们的结果表明，这种方法可以成功预测人的偏好模式。与使用手动设置的权重的语音相比，使用以这种方式确定的权重的合成语音始终显示出更平滑的过渡和更高的语音质量。

著录项

来源
《International journal of speech technology》 |2003年第4期|p.347-356|共10页
作者
MINKYU LEE; DANIEL P. LOPRESTI; JOSEPH P. OLIVE;
展开▼
作者单位

Bell Labs, Lucent Technologies, 600 Mountain Avenue, Murray Hill, NJ 07974, USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
text-to-speech; unit selection; viterbi searching;

机译：文本到语音;单位选择;维特比搜索;

相似文献

外文文献
中文文献
专利

1. Optimal weight tuning method for unit selection cost functions in syllable based text-to-speech synthesis [J] . N. P. Narendra, K. Sreenivasa Rao Applied Soft Computing . 2013,第2期

机译：基于音节的语音合成中单位选择成本函数的最优权重调整方法
2. Evolutionary Programming Based Optimal Power Flow for Units with Non-Smooth Fuel Cost Functions [J] . R. GNANADASS, P. VENKATESH, NARAYANA PRASAD PADHY Electric Power Components and Systems . 2005,第3期

机译：具有非光滑燃料成本函数的机组基于进化规划的最优潮流
3. On the Construction of Boolean Functions with Optimal Algebraic Immunity Based on Factorization of Numbers of Variables [J] . Huajin CHEN, Wenfeng QI, Chuangui MA IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences . 2013,第1期

机译：基于变量数分解的最优代数免疫布尔函数的构造
4. Perceptual Cost Functions for Unit Searching in Large Corpus-based Concatenative Text-to-Speech [C] . Minkyu Lee European conference on speech communication and technology . 2001

机译：基于语料库的基于语料库的连接文本到语音的单位搜索的感知成本函数
5. The effect of clinical pathways in reducing hospital length of stay and hospital costs and improving functional outcomes in total hip and knee arthroplasty patients: A systematic review. [D] . Patel, Hiren M. 2008

机译：临床途径在减少全髋和膝关节置换患者的住院时间和住院费用以及改善功能结局方面的效果：系统评价。
6. INTERVAL (investigation of NICE technologies for enabling risk-variable-adjusted-length) dental recalls trial: a multicentre randomised controlled trial investigating the best dental recall interval for optimum cost-effective maintenance of oral health in dentate adults attending dental primary care [O] . Jan E. Clarkson, Nigel B. Pitts, Debbie Bonetti, 2018

机译：INTERVAL（用于实现风险可变长度可变长度的NICE技术的调查）牙科召回试验：一项多中心随机对照试验研究最佳牙科召回间隔以最佳经济有效地保持牙科初级保健就诊的成年成年人口腔健康
7. VARIABLE-LENGTH UNIT SELECTION USING LSA-BASED SYNTACTIC STRUCTURE COST [O] . Chung-hsien Wu, Chi-chun Hsia, Jiun-fu Chen, 2014

机译：基于Lsa的合成结构成本选择变长单元

A Text-to-Speech Platform for Variable Length Optimal Unit Searching Using Perception Based Cost Functions

摘要

著录项

相似文献

相关主题

期刊订阅