Sentence-HMM state-based i-vector/PLDA modelling for improved performance in text dependent single utterance speaker verification

Osman Büyük

首页> 外文期刊>Signal Processing, IET >Sentence-HMM state-based i-vector/PLDA modelling for improved performance in text dependent single utterance speaker verification

【24h】

Sentence-HMM state-based i-vector/PLDA modelling for improved performance in text dependent single utterance speaker verification

机译：基于Sentence-HMM状态的i-vector / PLDA建模可提高与文本相关的单个说话者说话人验证的性能

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we make use of hidden Markov model (HMM) state alignment information in i-vector/probabilistic linear discriminant analysis (PLDA) framework to improve the verification performance in a text-dependent single utterance (TDSU) task. In the TDSU task, speakers repeat a fixed utterance in both enrollment and authentication sessions. Despite Gaussian mixture models (GMMs) have been the dominant modeling technique for text-independent applications, an HMM based method might be better suited for the TDSU task since it captures the co-articulation information better. Recently, powerful channel compensation techniques such as joint factor analysis (JFA), i-vectors and PLDA have been proposed for GMM based text-independent speaker verification. In this study, we train a separate i-vector/PLDA model for each sentence HMM state in order to utilize the alignment information of the HMM states in a TDSU task. The proposed method is tested using a multi-channel speaker verification database. In the experiments, it is observed that HMM state based i-vector/PLDA (i-vector/PLDA-HMM) provides approximately 67% relative reduction in equal error rate (EER) when compared to the i-vector/PLDA. The proposed method also outperforms the baseline GMM and sentence HMM methods. It yields approximately 51% relative reduction in EER over the best performing sentence HMM method.

机译：在本文中，我们在i-矢量/概率线性判别分析（PLDA）框架中使用了隐马尔可夫模型（HMM）状态对齐信息，以提高在文本相关的单言语（TDSU）任务中的验证性能。在TDSU任务中，演讲者会在注册和身份验证会话中重复固定的发音。尽管高斯混合模型（GMM）已成为独立于文本的应用程序的主要建模技术，但基于HMM的方法可能更好地适合TDSU任务，因为它可以更好地捕获共同发音信息。最近，针对基于GMM的独立于文本的说话者验证，已经提出了强大的信道补偿技术，例如联合因子分析（JFA），i矢量和PLDA。在这项研究中，我们为每个句子HMM状态训练一个单独的i-vector / PLDA模型，以便在TDSU任务中利用HMM状态的对齐信息。使用多通道扬声器验证数据库对提出的方法进行了测试。在实验中，观察到与i-vector / PLDA相比，基于HMM状态的i-vector / PLDA（i-vector / PLDA-HMM）提供了大约67％的均等错误率（EER）相对降低。所提出的方法也优于基线GMM和句子HMM方法。与性能最佳的句子HMM方法相比，它的EER相对降低了约51％。

著录项

来源
《Signal Processing, IET》 |2016年第8期|918-923|共6页
作者
Osman Büyük;
展开▼
作者单位

Kocaeli University, Turkey;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
vectors; authorisation; error analysis; Gaussian processes; hidden Markov models; mixture models; speaker recognition;

机译：向量;授权;误差分析;高斯过程;隐马尔可夫模型;混合模型;说话人识别;
入库时间 2022-08-17 13:32:17

相似文献

外文文献
中文文献
专利

1. Speaker-Phrase-Specific Adaptation of PLDA Model for Improved Performance in Text-Dependent Speaker Verification [J] . Laskar Mohammad Azharuddin, Bhanja Chuya China, Laskar Rabul Hussain Circuits, systems and signal processing . 2021,第10期

机译：PLDA模型的扬声器 - 短语特定调整，提高文本依赖扬声器验证中的性能
2. Speaker-Phonetic I-Vector Modeling for Text-Dependent Speaker Verification with Random Digit Strings [J] . Shengyu YAO, Ruohua ZHOU, Pengyuan ZHANG IEICE transactions on information and systems . 2019,第2期

机译：带有随机数字字符串的文本相关说话人验证的说话人语音I矢量建模
3. Model selection and score normalization for text-dependent single utterance speaker verification [J] . OSMAN BüYüK, MUSTAFA LEVENT ARSLAN Turkish Journal of Electrical Engineering and Computer Sciences . 2012,第Supa2期

机译：模型选择和分数归一化，用于与文本相关的单个说话者说话人验证
4. Phonetically-constrained PLDA modeling for text-dependent speaker verification with multiple short utterances [C] . Larcher Anthony, Lee Kong Aik, Ma Bin, IEEE International Conference on Acoustics, Speech and Signal Processing . 2013

机译：具有语音约束的PLDA建模，用于具有多个简短发音的与文本相关的说话人验证
5. Speaker adaptation in joint factor analysis based text independent speaker verification [D] . Shou-Chun, Yin 2007

机译：基于联合因素分析的文本自适应说话人验证中的说话人适应
6. Bidirectional Attention for Text-Dependent Speaker Verification [O] . Xin Fang, Tian Gao, Liang Zou, 2020

机译：文本依赖扬声器验证的双向关注
7. Improving short utterance i-vector speaker verification using utterance variance modelling and compensation techniques [O] . Kanagasundaram A., Dean D., Sridharan S., 2014

机译：使用话语方差建模和补偿技术改善短话语i矢量说话者验证

Sentence-HMM state-based i-vector/PLDA modelling for improved performance in text dependent single utterance speaker verification

摘要

著录项

相似文献

相关主题

期刊订阅