首页> 外文会议>International Conference on Quality of Multimedia Experience >Speech Quality Factors for Traditional and Neural-Based Low Bit Rate Vocoders
【24h】

Speech Quality Factors for Traditional and Neural-Based Low Bit Rate Vocoders

机译:传统和基于神经网络的低码率声码器的语音质量因子

获取原文

摘要

This study compares the performances of different algorithms for coding speech at low bit rates. In addition to widely deployed traditional vocoders, a selection of recently developed generative-model-based coders at different bit rates are contrasted. Performance analysis of the coded speech is evaluated for different quality aspects: accuracy of pitch periods estimation, the word error rates for automatic speech recognition, and the influence of speaker gender and coding delays. A number of performance metrics of speech samples taken from a publicly available database were compared with subjective scores. Results from subjective quality assessment do not correlate well with existing full reference speech quality metrics. The results provide valuable insights into aspects of the speech signal that will be used to develop a novel metric to accurately predict speech quality from generative-model-based coders.
机译:这项研究比较了不同算法在低比特率下编码语音的性能。除了广泛部署的传统声码器以外,还对最近开发的基于生成模型的不同比特率编码器的选择进行了对比。针对不同质量方面评估了编码语音的性能分析:基音周期估计的准确性,用于自动语音识别的单词错误率以及说话者性别和编码延迟的影响。从公共数据库中获取的语音样本的许多性能指标与主观得分进行了比较。主观质量评估的结果与现有的完整参考语音质量度量标准并没有很好的相关性。结果为语音信号的各个方面提供了宝贵的见解,这些语音信号将用于开发一种新颖的度量标准,以准确地预测基于生成模型的编码器的语音质量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号