Improvement of Tone Intelligibility for Average-Voice-Based Thai Speech Synthesis

Suphattharachai Chomphan; Chutarat Chompunth

首页> 外文期刊>American journal of applied sciences >Improvement of Tone Intelligibility for Average-Voice-Based Thai Speech Synthesis

【24h】

Improvement of Tone Intelligibility for Average-Voice-Based Thai Speech Synthesis

机译：改善基于平均语音的泰语语音合成的音质清晰度

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Problem statement: Tone intelligibility in speech synthesis is an important attribute that should be taken into account. The tone correctness of the synthetic speech is degraded considerably in the average-voice-based HMM-based Thai speech synthesis. The tying mechanism in the decision tree based context clustering without appropriate criterion causes unexpected tone neutralization. Incorporation of the phrase intonation to the context clustering process in the training stage was proposed early. However, the tone correctness is not satisfied. Approach: This study proposes a number of tonal features including tone-geometrical features and phrase intonation features to be exploited in the context clustering process of HMM training stage. Results: In the experiments, subjective evaluations of both average voice and adapted voice in terms of the intelligibility of tone are conducted. Effects on decision trees of the extracted features are also evaluated. By considering gender in training speech, two core experiments were conducted. The first experiment shows that the proposed tonal features can improve the tone intelligibility for female speech model above that of male speech model, while the second experiment shows that the proposed tonal features give the better improvement of the tone intelligibility for gender dependent model than for gender independent model. Conclusion: All of the experimental results confirm that the tone correctness of the synthesized speech from the average-voice-based HMM-based Thai speech synthesis is significantly improved when using most of the extracted features.

机译：问题陈述：语音合成中的语音清晰度是应考虑的重要属性。在基于平均语音的基于HMM的泰语语音合成中，合成语音的音调正确性大大降低。没有适当条件的基于决策树的上下文聚类中的绑定机制会导致意外的色调中和。提早在训练阶段将短语语调并入上下文聚类过程中。但是，不满足色调正确性。方法：本研究提出了许多音调特征，包括音调几何特征和短语语调特征，这些特征将在HMM训练阶段的上下文聚类过程中加以利用。结果：在实验中，对平均声音和适应声音的主观评价均基于音调的清晰度进行。还评估了提取特征对决策树的影响。通过在培训演讲中考虑性别，进行了两个核心实验。第一个实验表明，所提出的音调特征可以提高女性言语模型的音调清晰度，而第二个实验表明，所提出的音调特征对于性别依赖性模型的音调清晰度要比对性别的模型更好。独立模型。结论：所有实验结果都证实，使用大多数提取的特征时，基于平均语音的基于HMM的泰语语音合成的合成语音的正确性得到了显着提高。

著录项

来源
《American journal of applied sciences》 |2012年第3期|p.358-364|共7页
作者
Suphattharachai Chomphan; Chutarat Chompunth;
展开▼
作者单位

Department of Electrical Engineering,Faculty of Engineering at Si Racha,Kasetsart University, 199 M.6,Tungsukhla, Si Racha, Chonburi, 20230, Thailand;

School of Social and Environmental Development,National Institute of Development Administration, 118 M.3, Serithai Road,Klong-Chan, Bangkapi, Bangkok, 10240, Thailand;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
thai speech; speech synthesis; tone intelligibility; tone correctness; generative model; context clustering; average voice; hidden markov models;

机译：泰语语音合成音调清晰度音调正确;生成模型上下文聚类;平均声音;隐藏的马尔可夫模型;

相似文献

外文文献
中文文献
专利

1. Improvement of Tone Intelligibility for Average-Voice-Based Thai Speech Synthesis | Science Publications [J] . Chutarat Chompunth, Suphattharachai Chomphan American journal of applied sciences . 2012,第3期

机译：基于平均语音的泰语语音合成的音质清晰度改善科学出版物
2. A Context Clustering Technique for Improvement of Tone Intelligibility of Average-voice-based Thai Speech Synthesis [J] . Suphattharachai Chomphan, Takao Kobayashi 電子情報通信学会技術研究報告 . 2008,第551期

机译：一种提高基于平均声音的泰语语音合成音质清晰度的上下文聚类技术
3. A Context Clustering Technique for Improvement of Tone Intelligibility of Average-voice-based Thai Speech Synthesis [J] . Suphattharachai Chomphan, Takao Kobayashi 電子情報通信学会技術研究報告. 音声. Speech . 2007,第551期

机译：一种提高基于平均语音的泰语语音合成音质清晰度的上下文聚类技术
4. Tonal context labeling using quantized F0 symbols for improving tone correctness in average-voice-based speech synthesis [C] . Chunwijitra Vataya, Nose Takashi, Kobayashi Takao 2011 IEEE International Conference on Acoustics, Speech and Signal Processing . 2011

机译：在基于平均语音的语音合成中使用量化的F0符号进行音调上下文标记以提高音调正确性
5. Tone classification of syllable-segmented Thai speech based on multilayer perceptron. [D] . Satravaha, Nuttavudh. 2002

机译：基于多层感知器的音节段泰语语音的音调分类。
6. The Binaural Masking-Level Difference of Mandarin Tone Detection and the Binaural Intelligibility-Level Difference of Mandarin Tone Recognition in the Presence of Speech-Spectrum Noise [O] . Cheng-Yu Ho, Pei-Chun Li, Yuan-Chuan Chiang, -1

机译：语音频谱噪声下普通话检测的双耳掩蔽水平差异和普通话识别的双耳可懂度水平差异
7. Improvement of Tone Intelligibility for Average-Voice-Based Thai Speech Synthesis [O] . Suphattharachai Chomphan, Chutarat Chompunth 2012

机译：改善基于平均语音的泰语语音合成的音质清晰度
8. Effect of Tone/Noise Combination on Speech Intelligibility. [R] . Pearsons, K. S. 1976

机译：音/噪声组合对语音清晰度的影响。

Improvement of Tone Intelligibility for Average-Voice-Based Thai Speech Synthesis

摘要

著录项

相似文献

相关主题

期刊订阅