高保真录音设备和回放设备的普及化及便携化,给说话人识别系统的抗回放语音攻击带来了严峻挑战.通过语谱图分析原始语音和回放语音在高频区的差异,有针对性地将语音信号在求取Mel(梅尔)倒谱系数过程中的Mel滤波器组逆置,并将DCT前的Mel对数频谱系数作为算法的特征.最后,利用支持向量机作为分类器对待测语音进行判别.实验结果表明,此算法能够有效地检测回放语音.另外,将此算法加载到GMM-UBM说话人识别系统后,显著地提升了系统的抗回放语音攻击能力.%The popularity and portability of high-fidelity audio recording equipment and playback equipment poses a serious challenge for speaker recognition systems against playback attacks.Based on the differences between the original speech and the playback speech in high frequency region,the algorithm reversed the Mel-filter bank in Mel-frequency cepstral coefficient (MFCC) calculation,and the coefficients before the DCT were used as the features of the algorithm.SVM was utilized as the classifier.Experimental results show that this algorithm can effectively detect the playback speech.In addition,the algorithm is integrated into the GMM-UBM speaker recognition system,which significantly improves the systems' capability of resisting the playback attack.
展开▼