首页> 外文期刊>International journal of speech technology >Maximum entropy PLDA for robust speaker recognition under speech coding distortion
【24h】

Maximum entropy PLDA for robust speaker recognition under speech coding distortion

机译:最大熵PLDA用于语音编码失真下的健壮说话人识别

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

The system combining i-vector and probabilistic linear discriminant analysis (PLDA) has been applied with great success in the speaker recognition task. The i-vector space gives a low-dimensional representation of a speech segment and training data of a PLDA model, which offers greater robustness under different conditions. In this paper, we propose a new framework based on i-vector/PLDA and Maximum Entropy (ME) to improve the performance of speaker identification system in the presence of speech coding distortion. The results are reported on TIMIT database and speech coding obtained by passing the speech test from TIMIT database through the AMR encoder/decoder. Our results show that the proposed methode achieves improved performance when compared with the i-vector/PLDA and MEGMM.
机译:结合了i-vector和概率线性判别分析(PLDA)的系统已在说话人识别任务中取得了巨大成功。 i向量空间给出了语音段的低维表示和PLDA模型的训练数据,这在不同条件下具有更高的鲁棒性。在本文中,我们提出了一个基于i-vector / PLDA和最大熵(ME)的新框架,以提高存在语音编码失真的说话人识别系统的性能。结果报告在TIMIT数据库上,通过TIMIT数据库的语音测试通过AMR编码器/解码器获得语音编码。我们的结果表明,与i-vector / PLDA和MEGMM相比,该方法具有更高的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号