首页> 外文会议>EUSIPCO 2008;European signal processing conference >TEXT TEXT-DEPENDENT SPEAKER RECOGNITION BY COMPRESSED FEATURE FEATURE-DYNAMIC DYNAMICS DER S DERIV IVED FROM ED SINUSOIDAL REPRESENT REPRESENTATION OF SPEECH
【24h】

TEXT TEXT-DEPENDENT SPEAKER RECOGNITION BY COMPRESSED FEATURE FEATURE-DYNAMIC DYNAMICS DER S DERIV IVED FROM ED SINUSOIDAL REPRESENT REPRESENTATION OF SPEECH

机译:文本特征相关的说话人识别,通过压缩特征特征动力学从ED的正弦表示法得到

获取原文

摘要

Prevalent speaker recognition me methods use only spectral thods spectral-envelope based features such as MFCC, ignoring the richspeaker identity information contained in the temporal temporal-spectral dynamics of the entire speech signal. We propose anew feature for speaker recognition based on sinusoidalrepre representation of spe sentation speech ech called compressed spectral d dy- ynamics( namics Sino Sinogram ram-CSD) CSD), which effectively captures suchspectral dynamics and the inherent speaker identity. Thediscriminative pow power of CSD allows er classifica classification tion to r re- emamain simple. The proposed CSD in CSD-MSRI method uses a si sim- mplenearest neigh ple neighbor classifier to deliver bor performance co com- mpetitivepetitive to conventional MFCC+DTW based text text-dependentspeaker recogni recognition methods tion at signif significantly lower icantly co com- mplexity.Plexity.
机译:普遍的说话人识别方法仅使用基于频谱包络的​​频谱包络特征(例如MFCC),而忽略了整个语音信号的时态时域动态中包含的富说话人身份信息。我们基于语音表达的正弦表示形式,提出了一种新的说话人识别功能,称为压缩频谱动态学(南诺·汉诺·拉姆·CSD),它有效地捕获了这种频谱动态和固有的说话人身份。 CSD的区分电源可以使分类更简单。在CSD-MSRI方法中提出的CSD使用最简单的近邻分类器来提供与基于MFCC + DTW的传统基于文本的依赖于文本的说话者识别方法相竞争的bor性能,显着降低了复杂度。严谨。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号