A Concatenative Speech Synthesis Method Using Context Dependent Phoneme Sequences with Variable Length as Search Units

Hiroyuki SEGI; Tohru TAKAGI

首页> 外文期刊>電子情報通信学会技術研究報告. 音声. Speech >A Concatenative Speech Synthesis Method Using Context Dependent Phoneme Sequences with Variable Length as Search Units

【24h】

A Concatenative Speech Synthesis Method Using Context Dependent Phoneme Sequences with Variable Length as Search Units

机译：一种基于上下文的变长音素序列作为搜索单元的语音合成方法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

A concatenative speech synthesis method using words and clustered triphones as search units has been proposed. Only 20% words in the sentences, however, satisfy the phoneme context as well as enough quantity in the speech database although most of the words in the sentences can be found in the speech database. So in this paper, we try to use context dependent phoneme sequences not bound by words and to concatenate the speech segments satisfying phoneme context. We produce synthesized speech by proposed method using four different size speech databases. The subjective evaluation on then-naturalness shows that 1) mean opinion score is 3.6, 2) enlargement of speech database more than 43.2 hours doesn't contribute the improvement in naturalness 3) the processing time of speech synthesis shows a little increase with enlargement of speech database.

机译：提出了一种以词和簇三音为搜索单元的级联语音合成方法。尽管句子中的大多数单词都可以在语音数据库中找到，但是句子中只有20％的单词可以满足音素上下文以及语音数据库中足够的数量。因此，在本文中，我们尝试使用不受单词限制的上下文相关音素序列，并连接满足音素上下文的语音段。我们通过使用四个不同大小的语音数据库的建议方法来生成合成语音。对自然的主观评价表明：1）平均意见得分为3.6，2）语音数据库的扩大超过43.2小时没有对自然性的改善做出贡献3）语音合成的处理时间随的扩大而略有增加语音数据库。

著录项

来源
《電子情報通信学会技術研究報告. 音声. Speech》 |2003年第264期|共6页
作者
Hiroyuki SEGI; Tohru TAKAGI;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 jpn
中图分类电报、传真;
关键词
Speech Synthesis; Concatenation; Phoneme Sequences; Context Dependent; Database; Corpus;

机译：语音合成;串联;音素序列;上下文相关;数据库;Corpus;

相似文献

外文文献
中文文献
专利

1. A Concatenative Speech Synthesis Method Using Context Dependent Phoneme Sequences with Variable Length as Search Units [J] . Hiroyuki SEGI, Tohru TAKAGI 電子情報通信学会技術研究報告. 音声. Speech . 2003,第264期

机译：一种基于上下文的变长音素序列作为搜索单元的语音合成方法
2. Fast Concatenative Speech Synthesis Using Pre-Fused Speech Units Based on the Plural Unit Selection and Fusion Method [J] . Masatsune TAMURA, Tatsuya MIZUTANI, Takehiko KAGOSHIMA IEICE Transactions on Information and Systems . 2007,第2期

机译：基于多个单元选择和融合方法的预融合语音单元快速级联语音合成
3. Accurate visible speech synthesis based on concatenating variable length motion capture data [J] . Ma J., Cole R., Pellom B., IEEE transactions on visualization and computer graphics . 2006,第2期

机译：基于级联可变长度运动捕获数据的准确可见语音合成
4. Modeling variable length phoneme sequences — A step towards linguistic information for speech emotion recognition in wider world [C] . Kalani Wataraka Gamage, Vidhyasaharan Sethu, Eliathamby Ambikairajah International Conference on Affective Computing and Intelligent Interaction . 2017

机译：可变长度音素序列建模-迈向更广泛世界中用于语音情感识别的语言信息的一步
5. Advances in speaker-dependent concatenative speech synthesis. [D] . Chappell, David Thomas. 2000

机译：说话者相关的级联语音合成技术的进步。
6. Comparison of a Semiautomated Commercial Repetitive-Sequence-Based PCR Method with Spoligotyping 24-Locus Mycobacterial Interspersed Repetitive-Unit–Variable-Number Tandem-Repeat Typing and Restriction Fragment Length Polymorphism-Based Analysis of IS6110 for Mycobacterium tuberculosis Typing [O] . F. Brossier, C. Sola, G. Millot, 2014

机译：半自动化的基于商业重复序列的PCR方法与寡核苷酸分型24位点分枝杆菌重复单元-可变数串联重复序列分型和基于限制性片段长度多态性的IS6110结核分枝杆菌分型分析的比较
7. Accurate visible speech synthesis based on concatenating variable length motion capture data [O] . Jiyong Ma, Bryan Pellom, Wayne Ward, 2006

机译：基于级联可变长度运动捕获数据的准确可见语音合成

A Concatenative Speech Synthesis Method Using Context Dependent Phoneme Sequences with Variable Length as Search Units

摘要

著录项

相似文献

相关主题

期刊订阅