MLLR Transforms Based Speaker Recognition in Broadcast Streams

机译：MLLR在广播流中转换了基于扬声器识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper deals with utilization of maximum likelihood linear regression (MLLR) adaptation transforms for speaker recognition in broadcast news streams. This task is specific particularly for widely varying acoustic conditions, microphones, transmission channels, background noise and short duration of recordings (usually in the range from 5 to 15 seconds). MLLR transforms based features are modeled using support vector machines (SVM). Obtained results are compared with a GMM based system with traditional MFCC features. The paper also deals with inter-session variability compensation techniques suitable for both systems and emphases the importance of feature vector scaling for SVM based system.

机译：本文涉及利用最大似然线性回归（MLLR）适应转换的广播新闻流中的扬声器识别。该任务特别是针对广泛改变的声学条件，麦克风，传输通道，背景噪声和录制持续时间短（通常在5到15秒的范围内）。基于MLLR变换的功能是使用支持向量机（SVM）建模的。将得到的结果与具有传统MFCC功能的GMM基础系统进行比较。本文还涉及适用于两个系统的会话间可变性补偿技术，并重视基于SVM系统的特征矢量缩放的重要性。

著录项

来源
《COST Action 2102 International Conference on Cross-Modal Analysis of Speech, Gestures,Gaze and Facial Expressions》|2009年||共9页
会议地点
作者
Jan Silovsky; Petr Cerva; Jindrich Zdansky;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词
Speaker recognition; Broadcast news; MLLR; NAR;

机译：扬声器识别;广播新闻;mllr;nar;

相似文献

外文文献
中文文献
专利

1. Acoustic Model Training Using Pseudo-Speaker Features Generated by MLLR Transformations for Robust Speaker-Independent Speech Recognition [J] . Arata ITOH, Sunao HARA, Norihide KITAOKA, IEICE transactions on information and systems . 2012,第10期

机译：使用由MLLR转换生成的伪扬声器特征进行声学模型训练，以实现与扬声器无关的可靠语音识别
2. Acoustic Model Training Using Pseudo-Speaker Features Generated by MLLR Transformations for Robust Speaker-Independent Speech Recognition [J] . Arata ITOH, Sunao HARA, Norihide KITAOKA, IEICE Transactions on Information and Systems . 2012,第10期

机译：使用由MLLR转换生成的伪扬声器特征进行声学模型训练，以实现与扬声器无关的可靠语音识别
3. A study on MLLR-based speaker models using for speaker verification [J] . Masaharu Katoh, Junya Kanou, Akinori Ito, 電子情報通信学会技術研究報告. 音声. Speech . 2000,第137期

机译：基于MLLR的说话人模型用于说话人验证的研究
4. MLLR Transforms Based Speaker Recognition in Broadcast Streams [C] . Jan Silovsky, Petr Cerva, Jindrich Zdansky COST Action 2102 International Conference on Cross-Modal Analysis of Speech, Gestures,Gaze and Facial Expressions . 2009

机译：MLLR在广播流中转换了基于扬声器识别
5. Speaker Characteristic-based Acoustic Model Adaptation Method for Speaker Recognition Systems [D] . Millington, Daniel S. 2011

机译：基于说话者特征的说话人识别系统声学模型自适应方法
6. A Tree Based Broadcast Scheme for (m k)-firm Real-Time Stream in Wireless Sensor Networks [O] . HoSung Park, Beom-Su Kim, Kyong Hoon Kim, 2017

机译：无线传感器网络中（mk）个公司实时流的基于树的广播方案
7. Speaker recognition with session variability normalization based on MLLR adaptation transforms [O] . Andreas Stolcke, Senior Member, Sachin S. Kajarekar, 1987

机译：基于MLLR自适应变换的具有会话可变性归一化的说话人识别
8. Speaker Recognition with Region-Constrained MLLR Transforms. [R] . Stolcke, A., Mandal, A., Shriberg, E. 2013

机译：具有区域约束的mLLR变换的说话人识别。

MLLR Transforms Based Speaker Recognition in Broadcast Streams

摘要

著录项

相似文献

相关主题

期刊订阅