LP and TD-PSOLA-based incorporation of happiness in neutral speech using time-domain parameters

机译：使用时域参数在中性语音中基于LP和TD-PSOLA的幸福感融合

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Emotions express a person's internal state of being and it is reflected in the speech utterances. Emotions affect the time-domain characteristics of the speech signal, namely intonation patterns, speech rate, and short-term energy function. Conventional text-to-speech (TTS) systems are built to produce speech utterances for a given text, without any emotion, which can be called as neutral speech. Building a TTS system which can produce speech utterances with expected emotion is not a trivial task, in the sense that for each of the emotions, a separate speech corpus should be carefully collected and the system should be built. Therefore, the current work focuses on incorporating happiness into neutral speech using signal processing algorithms. In this regard, neutral and happy speech are analyzed and it is found that happiness can be perceived in certain emotive words in a sentence. Thus, in order to introduce happiness into neutral speech, these emotive keywords are identified and the above mentioned time-domain parameters are modified. Linear prediction-based synthesis of happy speech is initially performed. To improve the quality of the synthesized speech, TD-PSOLA is then used. Subjective evaluation yields a mean opinion score of 2.05 (out of a maximum of 3) for happy speech synthesized using linear prediction and 2.53 for those synthesized using TD-PSOLA.

机译：情绪表达一个人的内在状态，并在言语表达中得到反映。情绪会影响语音信号的时域特性，即语调模式，语速和短期能量函数。常规的文本语音转换（TTS）系统旨在为给定文本生成语音发声，而不会产生任何情感，这可以称为中性语音。在某种意义上说，构建一个可以产生具有预期情绪的语音发声的TTS系统并不是一件容易的事，因为对于每种情绪，都应该仔细收集一个单独的语音语料，并且应该构建该系统。因此，当前的工作集中在使用信号处理算法将幸福融入中性语音中。在这方面，分析了中性和快乐的言语，发现可以在句子中的某些情感词中感知到快乐。因此，为了将幸福引入中性语音中，识别了这些情感关键词并且修改了上述时域参数。最初执行基于线性预测的快乐语音合成。为了提高合成语音的质量，然后使用TD-PSOLA。对于使用线性预测合成的快乐语音，主观评估得出的平均意见得分为2.05（满分为3），对于使用TD-PSOLA合成的快乐讲话，主观评价得分为2.53。

著录项

来源
《International Conference on Circuit, Power and Computing Technologies》|2014年|1158-1162|共5页
会议地点
作者
Sreenidhi S.; Rachel G. Anushiya; Vijayalakshmi P.; Nagarajan T.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
TD-PSOLA; happiness incorporation; linear prediction; neutral speech; pitch contour; short-term energy;

机译：TD-PSOLA;幸福感;线性预测;中性语音;音高轮廓;短期能量;

相似文献

外文文献
中文文献
专利

1. Efficient time-domain numerical modelling of crosstalk between coaxial cables incorporating frequency-dependent parameters [J] . Teo Yu Xian, Thomas David W. P., Christopoulos Christos Science, Measurement & Technology, IET . 2020,第4期

机译：同轴电缆与频率相关参数之间的串扰的高效时域数值模型
2. Significance of incorporating excitation source parameters for improved emotion recognition from speech and electroglottographic signals [J] . Pravena D., Govind D. International journal of speech technology . 2017,第4期

机译：合并激励源参数对于改善语音和电声图信号的情感识别的意义
3. Linear trajectory models incorporating preprocessing parameters for speech recognition [J] . Chengalvarayan R. IEEE signal processing letters . 1998,第3期

机译：结合了用于语音识别的预处理参数的线性轨迹模型
4. LP and TD-PSOLA-based incorporation of happiness in neutral speech using time-domain parameters [C] . Sreenidhi S., Rachel G. Anushiya, Vijayalakshmi P., International Conference on Circuit, Power and Computing Technologies . 2014

机译：LP和TD-PSOLA的幸福在中立语音中的幸福结合使用时域参数
5. Study of neutral D meson - neutral anti-D meson mixing parameters using a time-dependent amplitude analysis of the decay neutral D meson going to neutral K(S) meson-pion-antipion. [D] . Andreassen, Rolf. 2010

机译：研究中性D介子-中性抗D介子的混合参数，使用随时间变化的振幅分析，分析中性D介子进入中性K（S）介子-对等离子的衰减。
6. Acoustic realization of Mandarin neutral tone and tone sandhi in infant-directed speech and Lombard speech [O] . Ping Tang, Nan Xu Rattanasone, Ivan Yuen, -1

机译：婴幼儿语音和伦巴德语音中普通话中性语调和语音变调的声学实现
7. A preliminary study of an audio-visual speech coder: using video parameters to reduce an LPC vocoder bit rate [O] . Foucher Elodie, Feng Gang, Girin Laurent 1998

机译：视听语音编码器的初步研究：使用视频参数降低LPC声码器的比特率
8. Incorporation of Atmospheric Flow Fields and Ground Interactions into Acoustic Finite-Difference, Time-Domain Simulations [R] . Wilson, D. K. , Marlin, D. H. , Collier, S. L. , 2004

机译：将大气流场和地面相互作用结合到声学有限差分，时域模拟中

LP and TD-PSOLA-based incorporation of happiness in neutral speech using time-domain parameters

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅