首页> 外文会议>Odyssey 2010: the speaker and language recognition workshop >Modeling Prosody for Speaker Recognition:Why Estimating Pitch May Be a Red Herring
【24h】

Modeling Prosody for Speaker Recognition:Why Estimating Pitch May Be a Red Herring

机译:说话人识别的韵律造型:为什么估计音高可能是红色鲱鱼

获取原文
获取原文并翻译 | 示例

摘要

It has long been claimed that spectral envelope features outperform prosodic features on speaker recognition tasks. However, the reasons for such an arrangement are not entirely compelling. In the current work we present some evidence to challenge these claims. We propose that energy found at harmonically related frequencies encodes the acoustic correlates of variables which are typically referred to as prosodic, making harmonic energy summation highly relevant. Its frequent implementation for estimating pitch appears to have gone unnoticed by the speaker recognition community, because pitch estimators quite deliberately discard what they compute, retaining only the abscissa of a maximum. We argue that this latter step renders pitch estimation somewhat ill-suited to speaker recognition tasks. We present the detailed construction of a discrete transform, and a normalization, which are amenable to relatively laconic modeling. With this framework we achieve or exceed the performance of spectral envelope features in nearfield, matched-channel and matched-multisession conditions; performance improves following envelope destruction. We believe these results may have far-reaching consequences. For speech processing in a multitude of applications, they suggest that modeling the harmonic structure in the way we propose is at least as relevant as is modeling other aspects of the signal.
机译:长期以来,人们一直认为频谱包络特征在说话人识别任务上胜过韵律特征。但是,这种安排的原因并不完全令人信服。在当前的工作中,我们提出一些证据来挑战这些主张。我们提出,在谐波相关频率处发现的能量编码通常称为韵律的变量的声学相关性,从而使谐波能量求和具有很高的相关性。说话者识别社区似乎没有注意到它经常用于估计音调,因为音调估计器故意放弃了它们的计算量,只保留了最大值的横坐标。我们认为,后面的步骤使音调估计有些不适合说话者识别任务。我们提出了离散变换和归一化的详细构造,它们适合于相对简单的建模。通过这种框架,我们可以达到或超过近场,匹配信道和匹配多会话条件下频谱包络特征的性能;破坏信封后性能会提高。我们认为这些结果可能会产生深远的影响。对于大量应用中的语音处理,他们建议以我们建议的方式对谐波结构进行建模至少与对信号的其他方面进行建模一样重要。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号