【24h】

MLLR Transforms Based Speaker Recognition in Broadcast Streams

机译:MLLR在广播流中转换了基于扬声器识别

获取原文

摘要

This paper deals with utilization of maximum likelihood linear regression (MLLR) adaptation transforms for speaker recognition in broadcast news streams. This task is specific particularly for widely varying acoustic conditions, microphones, transmission channels, background noise and short duration of recordings (usually in the range from 5 to 15 seconds). MLLR transforms based features are modeled using support vector machines (SVM). Obtained results are compared with a GMM based system with traditional MFCC features. The paper also deals with inter-session variability compensation techniques suitable for both systems and emphases the importance of feature vector scaling for SVM based system.
机译:本文涉及利用最大似然线性回归(MLLR)适应转换的广播新闻流中的扬声器识别。该任务特别是针对广泛改变的声学条件,麦克风,传输通道,背景噪声和录制持续时间短(通常在5到15秒的范围内)。基于MLLR变换的功能是使用支持向量机(SVM)建模的。将得到的结果与具有传统MFCC功能的GMM基础系统进行比较。本文还涉及适用于两个系统的会话间可变性补偿技术,并重视基于SVM系统的特征矢量缩放的重要性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号