首页> 外文OA文献 >Enhanced forensic speaker verification using a combination of DWT and MFCC feature warping in the presence of noise and reverberation conditions
【2h】

Enhanced forensic speaker verification using a combination of DWT and MFCC feature warping in the presence of noise and reverberation conditions

机译:DWT和MFCC在声音和混响条件下结合使用时,结合使用了DWT和MFCC功能变形功能,增强了鉴证演说者的验证能力

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Environmental noise and reverberation conditions severely degrade the performance of forensic speaker verification. Robust feature extraction plays an important role in improving forensic speaker verification performance. This paper investigates the effectiveness of combining features, Mel frequency cepstral coefficients (MFCC) and MFCC extracted from the discrete wavelet transform (DWT) of the speech, with and without feature warping for improving modern identityvector (i-vector) based speaker verification performance in the presence of noise and reverberation. The performance of ivector speaker verification was evaluated using different feature extraction techniques: MFCC, feature-warped MFCC, DWTMFCC, feature-warped DWT-MFCC, a fusion of DWT-MFCC and MFCC features and fusion feature-warped DWT-MFCC and feature-warped MFCC features.We evaluated the performance of i-vector speaker verification using the Australian Forensic Voice Comparison (AFVC) and QUT-NOISE databases in the presence of noise, reverberation, and noisy and reverberation conditions. Our results indicate that the fusion of feature-warped DWTMFCC and feature-warped MFCC is superior to other feature extraction techniques in the presence of environmental noise under the majority of signal to noise ratios (SNRs), reverberation, and noisy and reverberation conditions. At 0 dB SNR, the performance of the fusion of feature-warped DWT-MFCC and feature-warped MFCC approach achieves a reduction in average equal error rate (EER) of 21.33%, 20.00%, and 13.28% over feature-warped MFCC, respectively, in the presence of various types of environmental noises only, reverberation, and noisy and reverberation environments. The approach can be used for improving the performance of forensic speaker verification and it may be utilized for preparing legal evidence in court.
机译:环境噪声和混响条件严重降低了法医说话人验证的性能。强大的特征提取在提高法医说话者验证性能方面起着重要作用。本文研究了结合特征,从语音离散小波变换(DWT)提取的Mel频率倒谱系数(MFCC)和MFCC组合的有效性,在有无特征扭曲的情况下,改进了基于现代身份向量(i-vector)的说话者验证性能。噪音和混响的存在。使用不同的特征提取技术评估了ivector说话者验证的性能:MFCC,特征变形的MFCC,DWTMFCC,特征变形的DWT-MFCC,DWT-MFCC和MFCC特征的融合以及融合特征变形的DWT-MFCC和特征-扭曲的MFCC功能。我们在存在噪声,混响以及嘈杂和混响条件的情况下,使用澳大利亚法医语音比较(AFVC)和QUT-NOISE数据库评估了i-vector扬声器验证的性能。我们的结果表明,在大多数信噪比(SNR),混响以及噪声和混响条件下,存在环境噪声的情况下,特征扭曲DWTMFCC和特征扭曲MFCC的融合优于其他特征提取技术。在SNR为0 dB的情况下,扭曲特征DWT-MFCC和扭曲特征MFCC方法的融合性能比扭曲特征MFCC降低了21.33%,20.00%和13.28%的平均均等错误率(EER),仅在存在各种类型的环境噪声,混响以及嘈杂和混响环境的情况下。该方法可用于改善法医演说者验证的性能,并可用于在法庭上准备法律证据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号