TEXT-INDEPENDENT SPEAKER IDENTIFICATION USING GMM-UBM AND FRAME LEVEL LIKELIHOOD NORMALIZATION

机译：使用GMM-UBM和框架水平相似度标准化的文本无关的说话人识别

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we describe a Gaussian Mixture Model-Universal Background Model (GMM-UBM) speaker identification system. In this GMM-UBM system, we derive the hypothesized speaker model by adapting the parameters of UBM using the speaker's training speech and a form of Bayesian adaptation. The UBM technique is incorporated into the GMM speaker identification system to reduce the time requirement for recognition significantly. The paper also presents a new frame level likelihood score normalization for adjusting different scores of speaker models to get more robust scores in final decision. Experiments on the 2000 NIST Speaker Recognition Evaluation corpus show that GMM-UBM and frame level likelihood score normalization yield better performance. Compared to the baseline system, around 31.2% relative error reduction is obtained from the combination of both techniques.

机译：在本文中，我们描述了高斯混合模型-通用背景模型（GMM-UBM）说话人识别系统。在此GMM-UBM系统中，我们通过使用说话人的训练语音和贝叶斯自适应形式来调整UBM的参数，从而得出假设的说话人模型。 GBM说话人识别系统采用了UBM技术，以大大减少识别所需的时间。本文还提出了一种新的帧级别似然评分归一化方法，用于调整说话人模型的不同评分，以在最终决策中获得更可靠的评分。在2000年NIST说话者识别评估语料库上的实验表明，GMM-UBM和帧级别似然评分归一化可以产生更好的性能。与基线系统相比，两种技术的组合可减少约31.2％的相对误差。

著录项

来源
《International Symposium on Chinese Spoken Language Processing; 20041215-18; Hong Kong(CN)》|2004年|P.289-292|共4页
会议地点 Hong Kong(CN)
作者
Rong Zheng; Shuwu Zhang; Bo Xu;
展开▼
作者单位

High Technology and Innovation Center, National Laboratory of Pattern Recognition Institute of Automation, Chinese Academy of Sciences, Beijing;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类程序语言、算法语言;
关键词

相似文献

外文文献
中文文献
专利

1. Average framing linear prediction coding with wavelet transform for text-independent speaker identification system [J] . Khaled Daqrouq, Khalooq Y. Al Azzawi Computers and Electrical Engineering . 2012,第6期

机译：独立于文本的说话人识别系统的小波平均成帧线性预测编码
2. Cross similarity measurement for speaker adaptive test normalization in text-independent speaker verification [J] . ZHAO Jian, DONG Yuan, ZHAO Xian-yu, 中国邮电高校学报（英文版） . 2008,第002期

机译：跨相似度测量，用于独立于文本的说话人验证中的说话人自适应测试标准化
3. Robust regression fusion of GMM-UBM and GMM-SVM normalized scores using G729 bit-stream for speaker recognition over IP [J] . Dalila Yessad, Abderrahmane Amrouche International journal of speech technology . 2014,第1期

机译：使用G729比特流对GMM-UBM和GMM-SVM归一化分数进行稳健的回归融合，以通过IP进行说话人识别
4. Text-independent speaker identification using GMM-UBM and frame level likelihood normalization [C] . Rong Zheng, Shuwu Zhang, Bo Xu . 2004

机译：使用GMM-UBM和帧级别似然归一化的与文本无关的说话人识别
5. Implementation and Improvement of Common Text-Independent Speaker Identification [D] . Wang, Yunlong. 2020

机译：实施和改进常见的文本无关的扬声器识别
6. Towards understanding speaker discrimination abilities in humans and machines for text-independent short utterances of different speech styles [O] . Soo Jin Park, Gary Yeung, Neda Vesselinova, -1

机译：旨在理解人和机器中说话者的辨别能力以实现不同语音风格的与文本无关的简短发声
7. TEXT-INDEPENDENT SPEAKER IDENTIFICATION USING GMM-UBM AND FRAME LEVEL LIKELIHOOD NORMALIZATION [O] . Rong Zheng, Shuwu Zhang, Bo Xu 2014

机译：使用Gmm-UBm和帧级可能性标准化进行文本独立的扬声器识别

TEXT-INDEPENDENT SPEAKER IDENTIFICATION USING GMM-UBM AND FRAME LEVEL LIKELIHOOD NORMALIZATION

摘要

著录项

相似文献

相关主题

期刊订阅