Speaker Recognition Against Utterance Variations

机译：扬声器识别对话语变化

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

A speaker model in speaker recognition system is to be trained from a large data set gathered in multiple sessions. Large data set requires large amount of memory and computation, and moreover it's practically hard to make users utter the data in several sessions. Recently the incremental adaptation methods are proposed to cover the problems. However, the data set gathered from multiple sessions is vulnerable to the outliers from the irregular utterance variations and the presence of noise, which result in inaccurate speaker model. In this paper, we propose an incremental robust adaptation method to minimize the influence of outliers on Gaussian Mixture Model based speaker model. The robust adaptation is obtained from an incremental version of M-estimation. Speaker model is initially trained from small amount of data and it is adapted recursively with the data available in each session. Experimental results from the data set gathered over seven months show that the proposed method is robust against outliers.

机译：扬声器识别系统中的扬声器模型将从多个会话中收集的大数据集接受培训。大数据集需要大量的内存和计算，而且实际上很难让用户在几个会话中发出数据。最近提出了增量适应方法来涵盖问题。然而，从多个会话中收集的数据集容易受到来自不规则话语变化和存在噪声的异常值，这导致扬声器模型不准确。在本文中，我们提出了一种增量稳健的适应方法，以最大限度地减少基于高斯混合模型的扬声器模型的异常值的影响。从M估计的增量版本获得了鲁棒的适应。扬声器模型最初从少量数据培训，并且它递归适应每个会话中可用的数据。数据集的实验结果聚集在七个月内，表明该方法对异常值具有强大。

著录项

来源
《International confernce on computational science and its applications》|2003年||共7页
会议地点
作者
JongJoo Lee; JaeYeol Rheem; Ki Yong Lee;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类理论、方法;
关键词

相似文献

外文文献
中文文献
专利

1. Histogram equalization using a reduced feature set of background speakers’ utterances for speaker recognition [J] . Myung-jae?Kim, Il-ho?Yang, Min-seok?Kim, Frontiers of Information Technology & Electronic Engineering . 2017,第5期

机译：使用减少的背景说话者特征集进行直方图均衡以识别说话者
2. Histogram equalization using a reduced feature set of background speakers' utterances for speaker recognition [J] . Myung-jae KIM, Il-ho YANG, Min-seok KIM, 浙江大学学报（英文版）（C辑：计算机与电子） . 2017,第005期

机译：使用减少的背景说话者说话特征集进行直方图均衡以识别说话者
3. End-to-end DNN based text-independent speaker recognition for long and short utterances [J] . Rohdin Johan, Silnova Anna, Diez Mireia, Computer speech and language . 2020,第Jana期

机译：基于端到端DNN的，与文本无关的说话人识别，可实现长话和短话
4. Robust Speaker Recognition Against Utterance Variations [C] . JongJoo Lee, JaeYeol Rheem, Ki Yong Lee International Conference on Computational Science and Its Applications - ICCSA 2003 Pt.2 May 18-21, 2003 Montreal, Canada . 2003

机译：强大的说话人识别能力，可防止话语变化
5. Effects of equipment variations on speaker recognition error rates. [D] . Shaver, Clark D. 2009

机译：设备变化对说话人识别错误率的影响。
6. Speaker-external versus speaker-internal forces on utterance form: Do cognitive demands override threats to referential success? [O] . Liane Wardlow Lane, Victor S. Ferreira -1

机译：说话者对说话者的外部力量与说话者内部的力量形式：认知需求是否超越了指称成功的威胁？
7. Improving short utterance based I-vector speaker recognition using source and utterance-duration normalization techniques [O] . Kanagasundaram Ahilan, Dean David, Gonzalez-Dominguez Javier, 2013

机译：使用源和话语持续时间归一化技术改进基于短话语的I矢量说话人识别
8. Speaker Recognition from an Unknown Utterance and Speaker-Speech Interaction. [R] . Kashyap, R. L. 1976

机译：来自未知话语和说话者 - 语音交互的说话人识别。

Speaker Recognition Against Utterance Variations

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅