首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing;ICASSP >Speaker recognition via sparse representations using orthogonal matching pursuit
【24h】

Speaker recognition via sparse representations using orthogonal matching pursuit

机译:使用正交匹配追踪通过稀疏表示进行说话人识别

获取原文
获取原文并翻译 | 示例

摘要

The objective of this paper is to demonstrate the effectiveness of sparse representation techniques for speaker recognition. In this approach, each feature vector from unknown utterance is expressed as linear weighted sum of a dictionary of feature vectors belonging to many speakers. The weights associated with feature vectors in the dictionary are evaluated using orthogonal matching pursuit algorithm, which is a greedy approximation to l0 optimization. The weights thus obtained exhibit high level of sparsity, and only a few of them will have nonzero values. The feature vectors which belong to the correct speaker carry significant weights. The proposed method gives an equal error rate (EER) of 10.84% on NIST-2003 database, whereas the existing GMM-UBM system gives an EER of 9.67%. By combining evidence from both the systems an EER of 8.15% is achieved, indicating that both the systems carry complimentary information.
机译:本文的目的是证明稀疏表示技术对说话人识别的有效性。在这种方法中,来自未知话语的每个特征向量都表示为属于许多说话者的特征向量字典的线性加权和。使用正交匹配追踪算法评估字典中与特征向量相关的权重,该算法是对10优化的贪婪近似。这样获得的权重表现出高度的稀疏性,并且只有少数具有非零值。属于正确说话者的特征向量具有很大的权重。所提出的方法在NIST-2003数据库上给出的平均错误率(EER)为10.84%,而现有的GMM-UBM系统给出的EER为9.67%。通过将两个系统的证据相结合,可以达到8.15%的EER,这表明两个系统都携带有补充信息。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号