首页> 外文会议>Asia-Pacific Signal and Information Processing Association Annual Summit and Conference >Speaker Identification Using Pseudo Pitch Synchronized Phase Information in Voiced Sound
【24h】

Speaker Identification Using Pseudo Pitch Synchronized Phase Information in Voiced Sound

机译:在语音中使用伪音高同步相位信息识别说话人

获取原文

摘要

In conventional speaker identification methods based on mel-frequency cepstral coefficients (MFCCs), phase information is ignored. Our recent studies have shown that phase information contains speaker dependent characteristics. We propose a new extraction method to extract pitch synchronous phase information from the voiced section only. Speaker identifi- cation experiments were performed using the NTT clean database and JNAS database. Using the new phase extraction method, we obtained a relative reduction in the speaker error rate of approximately 27% and 46%, respectively, for the two databases. We also obtained a relative error reduction of approximately 52% and 42%, respectively, when combining phase information with the MFCC-based method.
机译:在基于梅尔频率倒谱系数(MFCC)的常规说话人识别方法中,相位信息被忽略。我们最近的研究表明,相位信息包含说话者相关的特征。我们提出了一种仅从浊音部分中提取音高同步相位信息的新提取方法。使用NTT clean数据库和JNAS数据库进行了说话人识别实验。使用新的相位提取方法,对于这两个数据库,我们分别将说话人错误率分别降低了约27%和46%。当将相位信息与基于MFCC的方法结合时,我们也分别获得了约52%和42%的相对误差减少。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号