Emotional speech acoustic model for Malay: Iterative versus isolated unit training

Mustafa M.B.; Ainon R.N.

首页> 外文期刊>The Journal of the Acoustical Society of America >Emotional speech acoustic model for Malay: Iterative versus isolated unit training

【24h】

Emotional speech acoustic model for Malay: Iterative versus isolated unit training

机译：马来语的情感语音声学模型：迭代与孤立单元训练

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The ability of speech synthesis system to synthesize emotional speech enhances the user's experience when using this kind of system and its related applications. However, the development of an emotional speech synthesis system is a daunting task in view of the complexity of human emotional speech. The more recent state-of-the-art speech synthesis systems, such as the one based on hidden Markov models, can synthesize emotional speech with acceptable naturalness with the use of a good emotional speech acoustic model. However, building an emotional speech acoustic model requires adequate resources including segment-phonetic labels of emotional speech, which is a problem for many under-resourced languages, including Malay. This research shows how it is possible to build an emotional speech acoustic model for Malay with minimal resources. To achieve this objective, two forms of initialization methods were considered: iterative training using the deterministic annealing expectation maximization algorithm and the isolated unit training. The seed model for the automatic segmentation is a neutral speech acoustic model, which was transformed to target emotion using two transformation techniques: model adaptation and context-dependent boundary refinement. Two forms of evaluation have been performed: an objective evaluation measuring the prosody error and a listening evaluation to measure the naturalness of the synthesized emotional speech.

机译：语音合成系统合成情感语音的能力增强了用户在使用这种系统及其相关应用程序时的体验。然而，鉴于人类情感语音的复杂性，情感语音合成系统的开发是艰巨的任务。最新的先进语音合成系统（例如基于隐马尔可夫模型的系统）可以通过使用良好的情感语音声学模型来合成具有可接受自然度的情感语音。但是，建立情感语音声学模型需要足够的资源，包括情感语音的段语音标签，这对于包括马来语在内的许多资源贫乏的语言来说都是一个问题。这项研究表明如何用最少的资源为马来人建立情感语音声学模型。为了实现此目标，考虑了两种形式的初始化方法：使用确定性退火期望最大化算法的迭代训练和孤立单元训练。用于自动分割的种子模型是中性语音声学模型，该模型使用两种转换技术转换为目标情感：模型自适应和上下文相关的边界细化。已经执行了两种形式的评估：测量韵律错误的客观评估和测量合成情感语音的自然性的听觉评估。

著录项

来源
《The Journal of the Acoustical Society of America 》 |2013年第1期| 共10页
作者
Mustafa M.B.; Ainon R.N.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
model; Malay; unit training;

机译：模型;马来语;单元训练;

相似文献

外文文献
中文文献
专利

1. Emotional speech acoustic model for Malay: Iterative versus isolated unit training [J] . Mustafa M.B., Ainon R.N. The Journal of the Acoustical Society of America . 2013 ,第4aPta1期

机译：马来语的情感语音声学模型：迭代与孤立单元训练
2. Prosodic Analysis And Modelling For Malay Emotional Speech Synthesis [J] . Gerry Knowles, Mumtaz B. Mustafa, Raja N. Ainon, Malaysian Journal of Computer Science . 2010 ,第2期

机译：马来语情感言语合成的韵律分析与建模
3. Acoustic Pronunciation Variations Modeling for Standard Malay Speech Recognition [J] . Noraini Seman, Kamaruzaman Jusoff Computer and Information Science . 2008 ,第4期

机译：用于标准马来语语音识别的原声发音变化建模
4. Minimum Phone Error model training on merged acoustic units for transcribing bilingual code-switched speech [C] . Yeh Ching-Feng, Lin Yiu-Chang, Lee Lin-Shan 2012 8th International Symposium on Chinese Spoken Language Processing. . 2012

机译：合并声学单元的最小电话错误模型训练，用于转录双语代码转换语音
5. Large margin training of acoustic models for speech recognition. [D] . Sha, Fei. 2007

机译：用于语音识别的声学模型的大幅度训练。
6. Improving Robustness of Deep Neural Network Acoustic Models via Speech Separation and Joint Adaptive Training [O] . Arun Narayanan, DeLiang Wang -1

机译：通过语音分离和联合自适应训练提高深度神经网络声学模型的鲁棒性
7. Developing an HMM-Based Speech Synthesis System for Malay: A Comparison of Iterative and Isolated Unit Training [O] . Mumtaz Begum MUSTAFA, Zuraidah Mohd DON, Raja Noor AINON, 2014

机译：制定马来语的基于HMM的语音合成系统：迭代和隔离单元培训的比较

Emotional speech acoustic model for Malay: Iterative versus isolated unit training

摘要

著录项

相似文献

相关主题

期刊订阅