首页> 外文会议>European Signal Processing Conference >Text-dependent speaker recognition by compressed feature-dynamics derived from sinusoidal representation of speech
【24h】

Text-dependent speaker recognition by compressed feature-dynamics derived from sinusoidal representation of speech

机译:通过从语音的正弦表示得出的压缩特征动力学来识别与文本相关的说话人

获取原文
获取外文期刊封面目录资料

摘要

Prevalent speaker recognition methods use only spectral-envelope based features such as MFCC, ignoring the rich speaker identity information contained in the temporal-spectral dynamics of the entire speech signal. We propose a new feature for speaker recognition based on sinusoidal representation of speech called compressed spectral dynamics (Sinogram-CSD), which effectively captures such spectral dynamics and the inherent speaker identity. The discriminative power of CSD allows classification to remain simple. The proposed CSD-MSRI method uses a simple nearest neighbor classifier to deliver performance competitive to conventional MFCC+DTW based text-dependent speaker recognition methods at significantly lower complexity.
机译:流行的说话人识别方法仅使用基于频谱包络的​​功能(例如MFCC),而忽略了整个语音信号的时域频谱动态中包含的丰富的说话人身份信息。我们提出了一种基于语音正弦表示的说话人识别新功能,称为压缩频谱动力学(Sinogram-CSD),可有效捕获此类频谱动力学和固有的说话人身份。 CSD的辨别力使分类保持简单。提出的CSD-MSRI方法使用一个简单的最近邻分类器,以较低的复杂度提供与传统的基于MFCC + DTW的基于文本的说话人识别方法相比具有竞争力的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号