An evaluation of automatic phoneme segmentation for concatenative speech synthesis

Hisashi Kawai; Tomoki Toda

首页> 外文期刊>電子情報通信学会技術研究報告. 音声. Speech >An evaluation of automatic phoneme segmentation for concatenative speech synthesis

【24h】

An evaluation of automatic phoneme segmentation for concatenative speech synthesis

机译：级联语音合成中自动音素分割的评估

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper studies the performance of automatic phoneme segmentation in three viewpoints: (1) temporal precision, (2) effects on segment selection, and (3) effects on the naturalness of synthetic speech. The absolute error of the phoneme beginning time for the best 90% and worst 10% were respectively 4.6ms and 25.9ms, which are comparable to discrepancies among human labelers. Our segment selection algorithm was found to have ability to eliminate waveform segments with large temporal errors, although not perfectly. As the result of a perception test in which naturalness was paircompared between synthetic speeches generated from hand4abeled data and auto-labeled data, it was found that the difference is marginal in practice although the latter is statistically inferior.

机译：本文从三个角度研究了自动音素分割的性能：（1）时间精度，（2）对片段选择的影响，以及（3）对合成语音的自然性的影响。最佳90％和最差10％的音素开始时间的绝对误差分别为4.6毫秒和25.9毫秒，这与人类标记者之间的差异相当。我们的片段选择算法被发现具有消除较大时间误差的波形片段的能力，尽管并不完美。进行感知测试的结果是，将手语数据和自动标记数据生成的合成语音进行自然配对，结果发现，尽管统计学上较差，但实际上差异很小。

著录项

来源
《電子情報通信学会技術研究報告. 音声. Speech》 |2002年第619期|共6页
作者
Hisashi Kawai; Tomoki Toda;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 jpn
中图分类电报、传真;
关键词
Phoneme segmentation; Speech synthesis; Corpus-base; Segment selection;

机译：音位分割;语音合成;基于Corpus的语言;句段选择;

相似文献

外文文献
中文文献
专利

1. An evaluation of automatic phoneme segmentation for concatenative speech synthesis [J] . Hisashi Kawai, Tomoki Toda 電子情報通信学会技術研究報告. 音声. Speech . 2002,第619期

机译：级联语音合成中自动音素分割的评估
2. A Concatenative Speech Synthesis Method Using Context Dependent Phoneme Sequences with Variable Length as Search Units [J] . Hiroyuki SEGI, Tohru TAKAGI 電子情報通信学会技術研究報告. 音声. Speech . 2003,第264期

机译：一种基于上下文的变长音素序列作为搜索单元的语音合成方法
3. A Concatenative Speech Synthesis Method Using Context Dependent Phoneme Sequences with Variable Length as Search Units [J] . Hiroyuki SEGI, Tohru TAKAGI 電子情報通信学会技術研究報告. 音声. Speech . 2003,第264期

机译：使用具有可变长度的上下文依赖性音素序列作为搜索单元的连接性语音合成方法
4. AN EVALUATION OF AUTOMATIC PHONE SEGMENTATION FOR CONCATENATIVE SPEECH SYNTHESIS [C] . Hisashi Kawai, Tomoki Toda IEEE International Conference on Acoustics, Speech, and Signal Processing . 2004

机译：相互作用语音合成自动电话分割的评估
5. Improving high quality concatenative text-to-speech synthesis using the circular linear prediction model. [D] . Shukla, Sunil Ravindra. 2007

机译：使用圆形线性预测模型改善高质量的串联文本到语音合成。
6. Brain-inspired speech segmentation for automatic speech recognition using the speech envelope as a temporal reference [O] . Byeongwook Lee, Kwang-Hyun Cho -1

机译：以语音包络作为时间参考的自动语音识别的大脑启发式语音分割
7. An Evaluation of Automatic Phone Segmentation for Concatenative Speech Synthesis [O] . Hisashi Kawai, Tomoki Toda 2004

机译：连接语音合成的自动电话分割评价
8. Simulation and Evaluation of Phonetic Speech Recognition Techniques. Volume II. Segmentation of Continuous Speech into Phonemes [R] . Otten, K. W. 1964

机译：语音识别技术的仿真与评估。第二卷。将连续语音分割成音素

An evaluation of automatic phoneme segmentation for concatenative speech synthesis

摘要

著录项

相似文献

相关主题

期刊订阅