首页> 外文期刊>International Journal of Advanced Information Technology >Speaker Identification Using a Nonlinear Speech Model and ANN
【24h】

Speaker Identification Using a Nonlinear Speech Model and ANN

机译:使用非线性语音模型和ANN的说话人识别

获取原文
           

摘要

This paper introduces a nonlinear speech model for improved speaker identification. We modelled the speaker identity using Reconstructed Phase Space (RPS) of the speech signal and the Phase Space Point Distribution (PSPD) parameters. The PSPD parameters are extracted from five vowels uttered by different speakers. The speaker identification experiments are conducted based on the PSPD parameters using Feed Forward Multi Layer Perceptron (FFMLP). Overall performance of the Mel-frequency cepstral baseline system is compared with the proposed composite classifier system using both cepstral and PSPD features across ten different speakers. The experimental results indicate that the accuracy of the phase space approach by itself is below that of MFCC features and further it shows that the proposed approach in which PSPD features when used with MFCC, pitch and first formant frequency offers reasonable improvement in speaker identification accuracy of the system
机译:本文介绍了一种非线性语音模型,用于改进说话人识别。我们使用语音信号的重构相空间(RPS)和相空间点分布(PSPD)参数对说话人身份进行建模。从不同说话者发出的五个元音中提取PSPD参数。使用前馈多层感知器(FFMLP)基于PSPD参数进行说话人识别实验。将梅尔频率倒谱基线系统的整体性能与在十个不同扬声器中同时使用倒谱和PSPD功能的拟议复合分类器系统进行了比较。实验结果表明,相空间方法本身的准确性低于MFCC特征的准确性,并且还表明,所提出的方法(其中PSPD特征与MFCC,基音和第一共振峰频率一起使用)可以合理地提高扬声器的说话人识别准确性。系统

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号