A Hierarchical Predictor of Synthetic Speech Naturalness Using Neural Networks

机译：使用神经网络的合成语音自然的分层预测因子

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

A problem when developing and tuning speech synthesis systems is that there is no well-established method of automatically rating the quality of the synthetic speech. This research attempts to obtain a new automated measure which is trained on the result of large-scale subjective evaluations employing many human listeners, i.e., the Blizzard Challenge. To exploit the data, we experiment with linear regression, feed-forward and convolutional neural network models, and combinations of them to regress from synthetic speech to the perceptual scores obtained from listeners. The biggest improvements were seen when combining stimulus- and system-level predictions.

机译：开发和调整语音合成系统的问题是，没有自动评估合成语音的质量的既定方法。该研究试图获得新的自动化措施，这些措施培训，这些措施受到许多人类听众的大规模主观评估的结果，即暴风雪挑战。为了利用数据，我们尝试线性回归，前馈和卷积神经网络模型，以及它们与从侦听器获得的感知分数的回归的组合。结合刺激和系统级预测时，可以看到最大的改进。

著录项

来源
《Annual Conference of the International Speech Communication Association》|2016年|744p|共5页
会议地点
作者
Takenori Yoshimura; Gustav Eje Henter; Oliver Watts; Mirjam Wester; Junichi Yamagishi; Keiichi Tokuda;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TB95-53;
关键词
入库时间 2022-08-21 11:41:06

相似文献

外文文献
中文文献
专利

1. Speech Therapy Interface for People with Speech Disorders Using Linear Predictive Coding, Mel Frequency Cepstrum and Neural Networks [J] . Priya S., Suresh A., Vijayalakshmi R. Journal of Medical Imaging and Health Informatics . 2016,第8期

机译：使用线性预测编码，MEL频率谱系和神经网络的语音障碍的言语治疗界面
2. Hierarchical Bayesian combination of plug-in maximum a posteriori decoders in deep neural networks-based speech recognition and speaker adaptation [J] . Huang Zhen, Siniscalchi Sabato Marco, Lee Chin-Hui Pattern recognition letters . 2017,第octa15期

机译：基于深度神经网络的语音识别和说话人自适应的插件最大后验解码器的分层贝叶斯组合
3. Hierarchical Singleton-Type Recurrent Neural Fuzzy Networks for Noisy Speech Recognition [J] . Juang C.-F., Chiou C.-T., Lai C.-L. IEEE Transactions on Neural Networks . 2007,第3期

机译：分层单例类型递归神经模糊网络用于嘈杂语音识别
4. A Hierarchical Predictor of Synthetic Speech Naturalness Using Neural Networks [C] . Takenori Yoshimura, Gustav Eje Henter, Oliver Watts, Annual Conference of the International Speech Communication Association . 2016

机译：利用神经网络的合成语音自然的分层预测因子
5. Gene expression temporal patterns classification with hierarchical Bayesian neural networks and time lagged recurrent neural networks. [D] . Liang, Yulan. 2003

机译：利用分层贝叶斯神经网络和时滞递归神经网络对基因表达时间模式进行分类。
6. The Effects of Modulating Fundamental Frequency and Speech Rate on the Intelligibility Communication Efficiency and Perceived Naturalness of Synthetic Speech [O] . Jennifer M. Vojtech, Jacob P. Noordzij, Jr., -1

机译：调制基本频率和语音速率对合成语音的可懂度通信效率和感知自然度的影响
7. A Hierarchical Predictor of Synthetic Speech Naturalness Using Neural Networks [O] . Yoshimura, Takenori, Henter, Gustav Eje, Watts, Oliver, 2016

机译：基于神经网络的综合语音自然度的层次预测
8. Hierarchical Neural Network (HNN) for Closed Loop Decision Making: Designing the Architecture of a Hierarchical Neural Network to Model Attention, Learning and Goal Oriented Behavior. [R] . Guez, A. 1990

机译：用于闭环决策的分层神经网络（HNN）：设计层次神经网络的体系结构以模拟注意，学习和目标导向行为。

A Hierarchical Predictor of Synthetic Speech Naturalness Using Neural Networks

摘要

著录项

相似文献

相关主题

期刊订阅