Statistical Synthesizer with Embedded Prosodic and Spectral Modifications to Generate Highly Intelligible Speech in Noise

机译：统计合成器具有嵌入式韵律和光谱修改，以产生高度可理解的噪音语音

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper describes a statistical parametric speech synthesizer that, despite having been trained on an ordinary synthesis database and without any adaptation data, is able to generate highly intelligible speech in noisy environments. By using a simple and flexible vocoder based on a harmonic model, it applies several noise-independent modifications to durations, pitch level and range, energy contour, formant sharpness, and intensity of particular spectral bands. The system has been evaluated by means of a large subjective test, the results of which show that the suggested approach clearly outperforms the reference TTS systems and even unmodified natural speech in some conditions.

机译：本文介绍了统计参数语音合成器，尽管已经在普通综合数据库上培训并且没有任何适应数据，但能够在嘈杂的环境中产生高度可理解的语音。通过使用基于谐波模型的简单且灵活的声码器，它将多个无关的噪声修改应用于特定光谱带的持续时间，俯仰水平和范围，能量轮廓，形成强度和强度。该系统已经通过大型主观测试评估，结果表明，建议的方法在某些条件下显然优于参考TTS系统，甚至未修改的自然语音。

著录项

来源
《Conference of the International Speech Communication Association》|2013年||共5页
会议地点
作者
D. Erro; T.C. Zoril?; Y. Stylianou; Navas I. Hernaez;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912.3-532;
关键词
Noise; Synthesizer; adaptation;

机译：噪音;合成器;适应;

相似文献

外文文献
中文文献
专利

1. Combining perceptually-motivated spectral shaping with loudness and duration modification for intelligibility enhancement of HMM-based synthetic speech in noise [J] . Cassia VALENTINI-BOTINHAO, Junichi YAMAGISHI, Simon KING, 電子情報通信学会技術研究報告. 音声. Speech . 2013,第76期

机译：将感知动机的频谱整形与响度和持续时间修改相结合，以提高基于HMM的合成语音在噪声中的清晰度
2. Combining perceptually-motivated spectral shaping with loudness and duration modification for intelligibility enhancement of HMM-based synthetic speech in noise [J] . Cassia VALENTINI-BOTINHAO, Junichi YAMAGISHI, Simon KING, 電子情報通信学会技術研究報告. 福祉情報工学. Welfare Information Technology . 2013,第77期

机译：将感知动机的频谱整形与响度和持续时间修改相结合，以提高基于HMM的合成语音在噪声中的清晰度
3. LOW FOOTPRINT HIGH INTELLIGIBILITY MALAY SPEECH SYNTHESIZER BASED ON STATISTICAL DATA [J] . Lau Chee Yong, Tan Tian Swee Journal of computer sciences . 2014,第2期

机译：基于统计数据的低足迹高智能马来语语音合成器
4. Statistical Synthesizer with Embedded Prosodic and Spectral Modifications to Generate Highly Intelligible Speech in Noise [C] . D. Erro, T.C. Zoril?, Y. Stylianou, Conference of the International Speech Communication Association . 2013

机译：统计合成器具有嵌入式韵律和光谱修改，以产生高度可理解的噪音语音
5. Prosody, intelligibility and familiarity in speech perception. [D] . McCloy, Daniel Robert. 2013

机译：语音感知中的韵律，清晰度和熟悉度。
6. The intelligibility of noise-vocoded speech: spectral information available from across-channel comparison of amplitude envelopes [O] . Brian Roberts, Robert J. Summers, Peter J. Bailey 2011

机译：噪声语音编码的清晰度：可从幅度包络的跨通道比较中获得频谱信息
7. LOW FOOTPRINT HIGH INTELLIGIBILITY MALAY SPEECH SYNTHESIZER BASED ON STATISTICAL DATA [O] . Lau Chee Yong, Tan Tian Swee 2015

机译：基于统计数据的低容量高智能马氏语音合成器
8. Prosodic Stress, Information, and Intelligibility of Speech in Noise [R] . Divenyi, P. L. 2009

机译：嘈杂声中的韵律应力，信息和可懂度

Statistical Synthesizer with Embedded Prosodic and Spectral Modifications to Generate Highly Intelligible Speech in Noise

摘要

著录项

相似文献

相关主题

期刊订阅