首页> 外文会议>International Conference on Digital Futures and Transformative Technologies >Speech Quality Assessment using Mel Frequency Spectrograms of Speech Signals
【24h】

Speech Quality Assessment using Mel Frequency Spectrograms of Speech Signals

机译:语音信号的语音质量评估语音信号

获取原文

摘要

Non-intrusive speech quality assessment (NI-SQA) has gained importance, due to recent advancements in multimedia, signal processing, machine learning, speech communication, and automatic speech recognition. The performance of NI-SQA techniques highly dependent on the extracted features to predict speech quality. In this article, a new machine learning-based method is proposed for predicting speech quality, without using reference signals is proposed. Traditional techniques used in literature cannot be implemented in practical application scenarios due to less correlation accuracy between subjective and objective scores. In this work, we used Mel-frequency cepstral coefficients (MFCCs) for predicting speech quality that is degraded in different noise conditions. We have computed the proposed work results on two independent databases. Experimental results show significant improvement in the performance when compared with current approaches for assessment of speech quality.
机译:由于近期多媒体,信号处理,机器学习,语音通信和自动语音识别,非侵入式语音质量评估(NI-SQA)已获得重要性。 NI-SQA技术的性能高度依赖于提取的特征来预测语音质量。 在本文中,提出了一种新的基于机器学习的方法,用于预测语音质量,而不使用参考信号。 由于主观和客观分数之间的相关精度较小,文学中使用的传统技术不能在实际应用方案中实现。 在这项工作中,我们使用熔融频率谱系数(MFCC)来预测在不同噪声条件下降低的语音质量。 我们计算了两个独立数据库的建议的工作结果。 实验结果表明,与当前评估语音质量的方法相比,性能显着提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号