Speech Quality Assessment using Mel Frequency Spectrograms of Speech Signals

机译：语音信号的语音质量评估语音信号

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Non-intrusive speech quality assessment (NI-SQA) has gained importance, due to recent advancements in multimedia, signal processing, machine learning, speech communication, and automatic speech recognition. The performance of NI-SQA techniques highly dependent on the extracted features to predict speech quality. In this article, a new machine learning-based method is proposed for predicting speech quality, without using reference signals is proposed. Traditional techniques used in literature cannot be implemented in practical application scenarios due to less correlation accuracy between subjective and objective scores. In this work, we used Mel-frequency cepstral coefficients (MFCCs) for predicting speech quality that is degraded in different noise conditions. We have computed the proposed work results on two independent databases. Experimental results show significant improvement in the performance when compared with current approaches for assessment of speech quality.

机译：由于近期多媒体，信号处理，机器学习，语音通信和自动语音识别，非侵入式语音质量评估（NI-SQA）已获得重要性。 NI-SQA技术的性能高度依赖于提取的特征来预测语音质量。在本文中，提出了一种新的基于机器学习的方法，用于预测语音质量，而不使用参考信号。由于主观和客观分数之间的相关精度较小，文学中使用的传统技术不能在实际应用方案中实现。在这项工作中，我们使用熔融频率谱系数（MFCC）来预测在不同噪声条件下降低的语音质量。我们计算了两个独立数据库的建议的工作结果。实验结果表明，与当前评估语音质量的方法相比，性能显着提高。

著录项

来源
《International Conference on Digital Futures and Transformative Technologies》|2021年|1-5|共5页
会议地点
作者
Shakeel Zafar; Imran Fareed Nizami; Muhammad Majid;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Support vector machines; Oral communication; Machine learning; Feature extraction; Quality assessment; Multimedia communication; Mel frequency cepstral coefficient;

机译：支持向量机;口头通信;机器学习;特征提取;质量评估;多媒体通信;麦倍频跳跃系数;

相似文献

外文文献
中文文献
专利

1. Principal component analysis of the spectrogram of the speech signal: Interpretation and application to dysarthric speech [J] . Kacha Abdellah, Grenez Francis, Rafael Orozco-Arroyave Juan, Computer speech and language . 2020,第Jana期

机译：语音信号频谱图的主成分分析：解构语音的解释和应用
2. Speech Therapy Interface for People with Speech Disorders Using Linear Predictive Coding, Mel Frequency Cepstrum and Neural Networks [J] . Priya S., Suresh A., Vijayalakshmi R. Journal of Medical Imaging and Health Informatics . 2016,第8期

机译：使用线性预测编码，MEL频率谱系和神经网络的语音障碍的言语治疗界面
3. Analysis and prediction of acoustic speech features from mel-frequency cepstral coefficients in distributed speech recognition architectures [J] . Darch J, Milner B, Vaseghi S The Journal of the Acoustical Society of America . 2008,第6期

机译：分布式语音识别架构中基于mel-频率倒谱系数的声学语音特征分析和预测
4. Dataset of Raw and Pre-processed Speech Signals, Mel Frequency Cepstral Coefficients of Speech and Heart Rate Measurements [C] . Mohammed Usman, Zeeshan Ahmad, Mohd Wajid International Conference on Signal Processing, Computing and Control . 2019

机译：原始和预处理语音信号的数据集，语音的梅尔频率倒谱系数和心率测量
5. Development of a speech recognition system using the Mel Frequency Cepstrum Coefficient method. [D] . Mahajan, Mayur. 2016

机译：使用梅尔频率倒谱系数方法开发语音识别系统。
6. Quality ratings of frequency-compressed speech by participants with extensive high-frequency dead regions in the cochlea [O] . Marina Salorio-Corbetto, Thomas Baer, Brian C. J. Moore -1

机译：耳蜗中具有大量高频死区的参与者对频率压缩语音的质量评级
7. Dataset of Raw and Pre-processed Speech Signals, Mel Frequency Cepstral Coefficients of Speech and Heart Rate Measurements [O] . Mohammed Usman, Zeeshan Ahmad, Mohd Wajid 2019

机译：原始和预处理语音信号的数据集，MEL频率谱系比的语音和心率测量

Speech Quality Assessment using Mel Frequency Spectrograms of Speech Signals

摘要

著录项

相似文献

相关主题

期刊订阅