JFA modeling with left-to-right structure and a new backend for text-dependent speaker recognition

机译：JFA建模，具有从左到右的结构以及一个新的后端，用于文本相关的说话人识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper introduces a new formulation of Joint Factor Analysis (JFA) for text-dependent speaker recognition based on left-to-right modeling with tied mixture HMMs. It accommodates many different ways of extracting multiple features to characterize speakers (features may or may not be HMM state-dependent, they may be modeled with subspace or factorial priors and these priors maybe imputed from text-dependent or text-independent background data). We feed these features to a new, trainable classifier for text-dependent speaker recognition in a manner which is broadly analogous to the i-vector/PLDA cascade in text-independent speaker recognition. We have evaluated this approach on a challenging proprietary dataset consisting of telephone recordings of short English and Urdu pass-phrases collected in Pakistan. By fusing results obtained with multiple front ends, equal error rate of around 2% are achievable.

机译：本文介绍了一种新的联合因子分析（JFA）公式，用于基于从左到右建模并带有混合HMM的文本相关的说话人识别。它提供了多种提取多个特征以表征说话人的方式（特征可能取决于HMM状态，也可能不取决于HMM状态，可以使用子空间或阶乘先验进行建模，并且这些先验可以从与文本相关或与文本无关的背景数据中推算出来）。我们将这些功能提供给一个新的，可训练的分类器，以与文本无关的说话人识别大体上类似于i-vector / PLDA级联的方式来进行与文本有关的说话人识别。我们已经在具有挑战性的专有数据集上评估了这种方法，该数据集包括在巴基斯坦收集的英语和乌尔都语短短语的电话录音。通过融合从多个前端获得的结果，可以实现大约2％的相等错误率。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2015年|4689-4693|共5页
会议地点
作者
Kenny Patrick; Stafylakis Themos; Alam Jahangir; Kockmann Marcel;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Joint Factor Analysis; text-dependent speaker recognition;

机译：联合因素分析;基于文本的说话人识别;

相似文献

外文文献
中文文献
专利

1. Speaker and Channel Factors in Text-Dependent Speaker Recognition [J] . Stafylakis Themos, Kenny Patrick, Alam Md. Jahangir, Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2016,第1期

机译：文本相关的说话人识别中的说话人和频道因素
2. Speaker-Phonetic I-Vector Modeling for Text-Dependent Speaker Verification with Random Digit Strings [J] . Shengyu YAO, Ruohua ZHOU, Pengyuan ZHANG IEICE transactions on information and systems . 2019,第2期

机译：带有随机数字字符串的文本相关说话人验证的说话人语音I矢量建模
3. Speaker-Phrase-Specific Adaptation of PLDA Model for Improved Performance in Text-Dependent Speaker Verification [J] . Laskar Mohammad Azharuddin, Bhanja Chuya China, Laskar Rabul Hussain Circuits, systems and signal processing . 2021,第10期

机译：PLDA模型的扬声器 - 短语特定调整，提高文本依赖扬声器验证中的性能
4. JFA modeling with left-to-right structure and a new backend for text-dependent speaker recognition [C] . P. Kenny, T. Stafylakis, J. Alam, IEEE International Conference on Acoustics, Speech and Signal Processing . 2015

机译：JFA建模与左右结构和文本依赖扬声器识别的新后端
5. Reducing computation in speaker recognition systems using a tree-structured universal background model. [D] . McClanahan, Richard Daniel. 2014

机译：使用树型通用背景模型来减少说话人识别系统中的计算。
6. Bidirectional Attention for Text-Dependent Speaker Verification [O] . Xin Fang, Tian Gao, Liang Zou, 2020

机译：文本依赖扬声器验证的双向关注
7. DIRECT MODELING OF SPOKEN PASSWORDS FOR TEXT-DEPENDENT SPEAKER RECOGNITION BY COMPRESSED TIME-FEATURE REPRESENTATIONS [O] . Amitava Das 2015

机译：用压缩时间特征表示直接识别语音相关语音识别的密码模型

JFA modeling with left-to-right structure and a new backend for text-dependent speaker recognition

摘要

著录项

相似文献

相关主题

期刊订阅