...
首页> 外文期刊>Audio, Speech, and Language Processing, IEEE/ACM Transactions on >Feature Extraction Using Power-Law Adjusted Linear Prediction With Application to Speaker Recognition Under Severe Vocal Effort Mismatch
【24h】

Feature Extraction Using Power-Law Adjusted Linear Prediction With Application to Speaker Recognition Under Severe Vocal Effort Mismatch

机译:幂律调整线性预测的特征提取及其在严重人声力度不匹配下的说话人识别中的应用

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Linear prediction is one of the most established techniques in signal estimation, and it is widely utilized in speech signal processing. It has been long understood that the nerve firing rate of human auditory system can be approximated by power law non-linearity, and this has been the motivation behind using perceptual linear prediction in extracting acoustic features in a variety of speech processing applications. In this paper, we revisit the application of power law non-linearity in speech spectrum estimation by compressing/expanding power spectrum in autocorrelation-based linear prediction. The development of so-called LP- is motivated by a desire to obtain spectral features that present less mismatch than conventionally used spectrum estimation methods when speech of normal loudness is compared to speech under vocal effort. The effectiveness of the proposed approach is demonstrated in a speaker recognition task conducted under severe vocal effort mismatch comparing shouted versus normal speech mode.
机译:线性预测是信号估计中最成熟的技术之一,在语音信号处理中得到了广泛的应用。长期以来人们一直了解,人类听觉系统的神经发声速率可以通过幂律非线性来近似,这一直是在各种语音处理应用中使用感知线性预测来提取声学特征的背后动机。在本文中,我们通过在基于自相关的线性预测中压缩/扩展功率谱,重新探讨了幂律非线性在语音谱估计中的应用。当将正常响度的语音与在人声下的语音进行比较时,寻求获得比常规使用的频谱估计方法呈现更少失配的频谱特征的愿望推动了所谓LP-的发展。比较大声与正常语音模式时,在严重的语音不匹配情况下进行的说话人识别任务证明了该方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号