首页> 外文会议>INTERSPEECH 2012 >Psychoacoustic Segment Scoring for Multi-Form Speech Synthesis

【24h】

Psychoacoustic Segment Scoring for Multi-Form Speech Synthesis

机译：多重语音合成的心理声学段评分

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In multi-form segment synthesis, output speech is constructed by splicing waveform segments with statistically modeled and regenerated parametric speech segments. The fraction of model-derived segments is called model-template ratio. The motivation of this work is to further increase flexibility of multi-form synthesis maintaining high speech quality for high model-template ratios. An approach is presented where the representation type of a segment is selected per acoustic leaf. We introduce a novel method for leaf representation selection based on a psychoacoustic segment stationarity score. Additionally, refinements in multi-form segment concatenation including boundary constrained statistical parametric synthesis and time-domain alignment based on multi-peak analysis of cross-correlation for high modeltemplate ratio multi-form synthesis are presented.

机译：在多态段合成中，通过拼接具有统计模型和再生参数语音段的波形段来构建输出语音。模型衍生段的分数称为模型模板比。这项工作的动机是进一步提高多种合成的灵活性，保持高模型模板比的高语音质量。呈现了每个声叶选择段的表示类型的方法。我们介绍了一种基于心理声学段的叶片表示选择的新方法。另外，介绍了基于用于高模型组合的互相关的多峰值分析的边界受限统计参数合成和时域对齐的多态分段级联的改进。

著录项

来源
《INTERSPEECH 2012》|2012年||共4页
会议地点
作者
Alexander Sorin; Slava Shechtman; Vincent Pollet;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 73.4136083;
关键词
speech synthesis; multi-form segments; speech stationarity; psychoacoustic segment scoring; statistical parametric synthesis; segment concatenation;

机译：语音合成;多形段;言语保证性;心理声学段得分;统计参数综合;段串联;

相似文献

外文文献
中文文献
专利

1. Encoding Navigable Speech Sources: A Psychoacoustic-Based Analysis-by-Synthesis Approach [J] . Zheng X., Ritz C., Xi J. Audio, Speech, and Language Processing, IEEE Transactions on . 2013,第1期

机译：编码可导航语音源：基于心理声学的综合分析方法
2. Optimizing phase dispersion for excitation source in speech synthesis with STRAIGHT: psychoacoustical evaluation and optimization of control parameters [J] . Hideki Iwasawa, Minoru Tsuzaki, Hisashi Kawai, 電子情報通信学会技術研究報告. 信号処理. Signal Processing . 2001,第323期

机译：使用STRAIGHT优化语音合成中激励源的相位分散：心理声学评估和控制参数的优化
3. Optimizing phase dispersion for excitation source in speech synthesis with STRAIGHT: psychoacoustical evaluation and optimization of control parameters [J] . Hideki Iwasawa, Minoru Tsuzaki, Hisashi Kawai, 電子情報通信学会技術研究報告. 音声. Speech . 2001,第325期

机译：使用STRAIGHT优化语音合成中激励源的相位分散：心理声学评估和控制参数的优化
4. Psychoacoustic Segment Scoring for Multi-Form Speech Synthesis [C] . Alexander Sorin, Slava Shechtman, Vincent Poller Annual conference of the International Speech Communication Association . 2012

机译：多形式语音合成的心理声学片段计分
5. Evaluation of Speech Perception and Psychoacoustic Abilities Following Chemotherapy [D] . Kappes, Melissa Skarl. 2018

机译：化疗后言语感知和心理声学能力评估
6. Clinical psychoacoustics in Alzheimers disease central auditory processing disorders and speech deterioration [O] . Vassiliki Iliadou, Stergios Kaprinis 2003

机译：阿尔茨海默氏病中枢听觉加工障碍和言语恶化的临床心理声学
7. Cyborg Speech: Deep Multilingual Speech Synthesis for Generating Segmental Foreign Accent with Natural Prosody [O] . Gustav Eje Henter, Jaime Lorenzo-Trueba, Xin Wang, 2018

机译：CYBORG演讲：深层多语言语音合成，用于生成与自然韵律的节段外雅
8. Part Ⅰ SEGMENTATION TECHNIQUES IS SPEECH 3YOTHBSIS Part Ⅱ A SEGMENT INVENTORY FOR SPEECH SYNTHESIS [R] . Gordon E. Peterson, William S-Y Wang 1958

机译：第一部分分段技术是语音合成第二部分语音合成的分段库存

Psychoacoustic Segment Scoring for Multi-Form Speech Synthesis

摘要

著录项

相似文献

相关主题

期刊订阅