Maximum entropy PLDA for robust speaker recognition under speech coding distortion

Ahmed Krobba; Mohamed Debyeche; Sid. Ahmed Selouani

首页> 外文期刊>International journal of speech technology >Maximum entropy PLDA for robust speaker recognition under speech coding distortion

【24h】

Maximum entropy PLDA for robust speaker recognition under speech coding distortion

机译：最大熵PLDA用于语音编码失真下的健壮说话人识别

获取原文

获取原文并翻译 | 示例

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The system combining i-vector and probabilistic linear discriminant analysis (PLDA) has been applied with great success in the speaker recognition task. The i-vector space gives a low-dimensional representation of a speech segment and training data of a PLDA model, which offers greater robustness under different conditions. In this paper, we propose a new framework based on i-vector/PLDA and Maximum Entropy (ME) to improve the performance of speaker identification system in the presence of speech coding distortion. The results are reported on TIMIT database and speech coding obtained by passing the speech test from TIMIT database through the AMR encoder/decoder. Our results show that the proposed methode achieves improved performance when compared with the i-vector/PLDA and MEGMM.

机译：结合了i-vector和概率线性判别分析（PLDA）的系统已在说话人识别任务中取得了巨大成功。 i向量空间给出了语音段的低维表示和PLDA模型的训练数据，这在不同条件下具有更高的鲁棒性。在本文中，我们提出了一个基于i-vector / PLDA和最大熵（ME）的新框架，以提高存在语音编码失真的说话人识别系统的性能。结果报告在TIMIT数据库上，通过TIMIT数据库的语音测试通过AMR编码器/解码器获得语音编码。我们的结果表明，与i-vector / PLDA和MEGMM相比，该方法具有更高的性能。

著录项

来源
《International journal of speech technology》 |2019年第4期|1115-1122|共8页
作者
Ahmed Krobba; Mohamed Debyeche; Sid. Ahmed Selouani;
展开▼
作者单位

Speech Communication and Signal Processing Laboratory Universite des Sciences et de la Technologie Houari Boumediene (USTHB) Algiers Algeria;

LARIHS Laboratory Campus Shappaing University of Moncton Moncton Canada;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
GMM-UBM; MEGMM; i-vector/PLDA; i-vector/MEPLDA; Speaker identification; Speech coding;

机译：GMM-UBM;MEGMM;i-vector / PLDA;i-vector / MEPLDA;说话人识别;语音编码;

相似文献

外文文献
中文文献
专利

1. Hierarchical Bayesian combination of plug-in maximum a posteriori decoders in deep neural networks-based speech recognition and speaker adaptation [J] . Huang Zhen, Siniscalchi Sabato Marco, Lee Chin-Hui Pattern recognition letters . 2017,第octa15期

机译：基于深度神经网络的语音识别和说话人自适应的插件最大后验解码器的分层贝叶斯组合
2. Distortion-class modeling for robust speech recognition under GSM RPE-LTP coding [J] . Juan M.Huerta, Richard M.Stern Speech Communication . 2001,第1a2期

机译：在GSM RPE-LTP编码下实现鲁棒语音识别的失真类建模
3. Acoustic Model Training Using Pseudo-Speaker Features Generated by MLLR Transformations for Robust Speaker-Independent Speech Recognition [J] . Arata ITOH, Sunao HARA, Norihide KITAOKA, IEICE transactions on information and systems . 2012,第10期

机译：使用由MLLR转换生成的伪扬声器特征进行声学模型训练，以实现与扬声器无关的可靠语音识别
4. Robust Speaker Recognition in the Presence of Speech Coding Distortion for Remote Access Applications [C] . Robert W. Mudrowsky, Ravi P. Ramachandran, Umashanger Thayasivam, International Conference on Data Mining . 2016

机译：在存在语音编码失真的情况下，强大的扬声器识别用于远程访问应用程序
5. Robust speaker recognition in the presence of speech coding distortion [D] . Mudrowsky, Robert W. 2016

机译：在语音编码失真存在下强大的扬声器识别
6. Recognizing the message and the messenger: biomimetic spectral analysis for robust speech and speaker recognition [O] . Sridhar Krishna Nemala, Kailash Patil, Mounya Elhilali -1

机译：识别消息和使者：仿生频谱分析可增强语音和说话者识别能力
7. An articulatorily constrained, maximum entropy approach to speech recognition and speech coding [O] . Hogden, J. 1996

机译：语音识别和语音编码的咬合约束，最大熵方法
8. Robust Speech Processing & Recognition: Speaker ID, Language ID, Speech Recognition/Keyword Spotting, Diarization/Co-Channel/Environmental Characterization, Speaker State Assessment. [R] . Hansen, J. H. 2015

机译：强大的语音处理和识别：说话者ID，语言ID，语音识别/关键字识别，Diarization / Co-Channel /环境表征，说话者状态评估。

Maximum entropy PLDA for robust speaker recognition under speech coding distortion

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅