首页> 外文期刊>Procedia Computer Science >A Comparative Study of IBM and IRM Target Mask for Supervised Malay Speech Separation from Noisy Background
【24h】

A Comparative Study of IBM and IRM Target Mask for Supervised Malay Speech Separation from Noisy Background

机译:来自嘈杂背景监督马来语言论分离的IBM和IRM目标面膜的比较研究

获取原文
           

摘要

This paper presents a comparative study of Ideal Binary Mask (IBM) and Ideal Ratio Mask (IRM) as training target for supervised Malay speech separation. Inspired by revolution of powerful computer system, Deep Neural Network (DNN) is used as a supervised algorithm to predict target mask from noisy mixture signal that is degraded by noise background. Although previous works showed IRM is better than IBM target mask with DNN algorithm, but it is incomparable due to different database. To validate DNN model with these target masks, 600 Malay utterances from a male and a female speaker were used in training session while remaining 120 Malay utterances were used in prediction session. The combination of acoustic features such as amplitude modulation spectrogram (AMS), mel-frequency cepstral coefficient (MFCC), relative spectral transformed perceptual linear prediction coefficients (RASTA-PLP) and Gammatone filter bank power spectra (GF) were used as input features to estimate target mask. The performance of intelligibility enhancement was evaluated using Short Time Objective Intelligibility (STOI) score. Average STOI score of IRM target mask indicated up to 0.83 for seen speakers while 0.76 for unseen speakers at -5dB babble noise, which is superior than IBM target mask.
机译:本文介绍了理想二元掩模(IBM)和理想比率面膜(IRM)作为监督马来语言论分离的培训目标的比较研究。灵感来自强大的计算机系统的革命,深神经网络(DNN)用作监督算法,以预测来自噪声背景下降的嘈杂混合信号的目标掩模。虽然以前的作品显示了IRM比IBM目标掩模更好,但由于数据库不同,它是无与伦比的。为了用这些目标掩模验证DNN模型,来自男性和女性扬声器的600种马来语话语在训练期间使用,同时剩下120种马来语话语在预测会议中使用。诸如幅度调制谱图(AMS)的声学特征的组合,MEL频率谱系码(MFCC),相对光谱变换的感知线性预测系数(RASTA-PLP)和γ-PLP)和γ-PLY滤波器组功率谱(GF)被用作输入特征估计目标面具。使用短时间客观可懂度(STOI)得分评估可明智性增强性能。 IRM目标面膜的平均STOI评分显示为可见扬声器的0.83,而在-5dB禁止噪声的看不见的扬声器0.76,比IBM目标面具优于IBM。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号