【24h】

PNCC for Forensic Automatic Speaker Recognition

机译:PNCC用于法医自动扬声器识别

获取原文

摘要

Forensic Automatic Speaker Recognition (FASR) is a process for making judgments on whether a particular speech utterance belongs to a suspected speaker. This is a challenging task as the sample tested may be received from different channels and noise conditions or may be disguised. Robust feature extraction plays an important role in improving the performance of FASR. In this paper, a forensic automatic speaker recognition system is implemented which exploits Power Normalized Cepstral Coefficients (PNCCs) and Mel Frequency Cepstral Coefficients (MFCCs) features. Performance of the system is demonstrated on a Malayalam speaker database. The speaker recognition framework is based on conventional i-vector based system. Experimental results suggest that the PNCC features provide slightly inferior performance with the MFCC features while tested under the conditions namely, masked speech, telephone speech, and Voice over Internet Protocol (VoIP). But, it is also observed that PNCC provides better performance than MFCC in noisy conditions.
机译:法医自动扬声器识别(FASR)是关于特定语音话语是否属于疑似扬声器的判断的过程。这是一种具有挑战性的任务,因为可以从不同的信道和噪声条件接收测试或者可以伪造的样本。鲁棒特征提取在提高Fasr性能方面发挥着重要作用。本文实现了一种法医自动扬声器识别系统,其利用功率归一化谱系齐系数(PNCC)和MEL频率谱系数(MFCCS)特征。在Malayalam扬声器数据库上展示了系统的性能。扬声器识别框架基于传统的基于I形向量的系统。实验结果表明,PNCC功能在条件下测试时,MFCC功能提供了略微低劣的性能,即屏蔽的语音,电话语音和互联网协议语音(VoIP)。但是,还观察到,PNCC在嘈杂的条件下提供比MFCC更好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号