首页> 外文期刊>Journal of electrical and computer engineering >Speaker Recognition Using Wavelet Cepstral Coefficient, I-Vector, and Cosine Distance Scoring and Its Application for Forensics
【24h】

Speaker Recognition Using Wavelet Cepstral Coefficient, I-Vector, and Cosine Distance Scoring and Its Application for Forensics

机译:小波倒谱系数,I向量和余弦距离计分的说话人识别及其在法医学中的应用

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

An important application of speaker recognition is forensics. However, the accuracy of speaker recognition in forensic cases often drops off rapidly because of the ill effect of ambient noise, variable channel, different duration of speech data, and so on. Therefore, finding a robust speaker recognition model is very important for forensics. This paper builds a new speaker recognition model based on wavelet cepstral coefficient (WCC), i-vector, and cosine distance scoring (CDS). This model firstly uses the WCC to transform the speech into spectral feature vecors and then uses those spectral feature vectors to train the i-vectors that represent the speeches having different durations. CDS is used to compare the i-vectors to give out the evidence. Moreover, linear discriminant analysis (LDA) and the within-class covariance normalization (WCNN) are added to the CDS algorithm to deal with the channel variability problem. Finally, the likelihood ratio estimates the strength of the evidence. We use the TIMIT database to evaluate the performance of the proposed model. The experimental results show that the proposed model can effectively solve the troubles of forensic scenario, but the time cost of the method is high.
机译:说话人识别的重要应用是取证。但是,由于环境噪声,可变通道,语音数据的不同持续时间等不良影响,在法医案例中说话人识别的准确性通常会迅速下降。因此,找到可靠的说话人识别模型对于法医学非常重要。本文基于小波倒谱系数(WCC),i矢量和余弦距离评分(CDS)建立了新的说话人识别模型。该模型首先使用WCC将语音转换为频谱特征vecor,然后使用这些频谱特征向量训练代表具有不同持续时间的语音的i-vector。 CDS用于比较i向量以给出证据。此外,将线性判别分析(LDA)和类内协方差归一化(WCNN)添加到CDS算法中,以处理信道可变性问题。最后,似然比估计证据的强度。我们使用TIMIT数据库来评估所提出模型的性能。实验结果表明,该模型可以有效解决法医场景的烦恼,但该方法的时间成本较高。

著录项

  • 来源
    《Journal of electrical and computer engineering》 |2016年第2期|4908412.1-4908412.11|共11页
  • 作者

    Lei Lei; She Kun;

  • 作者单位

    Laboratory of Cyberspace, School of Information and Software Engineering, University of Electronic Science and Technology of China, Chengdu 610054, China;

    Laboratory of Cyberspace, School of Information and Software Engineering, University of Electronic Science and Technology of China, Chengdu 610054, China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号