首页> 外文会议>European Conference on Speech Communication and Technology - EUROSPEECH 2003(INTERSPEECH 2003) vol.1; 20030901-04; Geneva(CH) >Predicting the Perceptive Judgment of Voices in a Telecom Context: Selection of Acoustic Parameters
【24h】

Predicting the Perceptive Judgment of Voices in a Telecom Context: Selection of Acoustic Parameters

机译:在电信环境中预测语音的感知判断:声学参数的选择

获取原文
获取原文并翻译 | 示例

摘要

Perception of vocal styles is of paramount importance in vocal server application as the global style of a telecom service is highly dependant on the voice used. In this work we develop tools for automatic inference of perceived vocal styles for a set of 100 vocal sequences. In a first stage, twenty subjective evaluation criteria have been identified by running perceptive experiments with naive listeners. In a second stage, the vocal sequences have been parameterised using more than a hundred acoustic features representing prosody, spectral energy distribution, articulation and waveform. Then, regression analysis and neural networks are used for predicting the subjective score of each voice for each subjective criterion. The results show that the prediction error is generally low: it seems possible to predict automatically the perceived quality of the sequences. Moreover, the prediction error decreases when non-significant parameters are removed.
机译:对声音风格的感知在声音服务器应用程序中至关重要,因为电信服务的全局风格高度依赖于所使用的声音。在这项工作中,我们开发了用于自动推断一组100个声音序列的感知声音样式的工具。在第一阶段,通过对幼稚的听众进行感知实验,确定了二十个主观评估标准。在第二阶段,使用一百多个代表韵律,频谱能量分布,清晰度和波形的声学特征对人声序列进行参数化。然后,使用回归分析和神经网络来预测每个主观标准的每个声音的主观得分。结果表明,预测误差通常较低:似乎可以自动预测序列的感知质量。而且,当去除不重要的参数时,预测误差减小。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号