A study on speaker normalized MLP features in LVCSR

机译：LVCSR中的说话人归一化MLP功能研究

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Different normalization methods are applied in recent Large Vocabulary Continuous Speech Recognition Systems (LVCSR) to reduce the influence of speaker variability on the acoustic models. In this paper we investigate the use of Vocal Tract Length Normalization (VTLN) and Speaker Adaptive Training (SAT) in Multi Layer Perceptron (MLP) feature extraction on an English task. We achieve significant improvements by each normalization method and we gain further by stacking the normalizations. Studying features transformed by Constrained Maximum Likelihood Linear Regression (CMLLR) based SAT as possible input for MLP, further experiments show that MLP could not consistently take advantage of SAT as it does in case of VTLN.

机译：在最近的大词汇量连续语音识别系统（LVCSR）中应用了不同的归一化方法，以减少说话者变异性对声学模型的影响。在本文中，我们研究了在英语任务的多层感知器（MLP）特征提取中使用人行道长度归一化（VTLN）和说话人自适应训练（SAT）。我们通过每种归一化方法都实现了显着的改进，并且通过堆叠归一化而进一步受益。研究基于约束最大似然线性回归（CMLLR）的SAT转换的特征作为MLP的可能输入，进一步的实验表明MLP不能像VTLN那样始终如一地利用SAT。

著录项

来源
《Annual conference of the International Speech Communication Association;INTERSPEECH 2011》|2011年|p.1096-1099|共4页
会议地点
作者
Zoltan Tueske; Christian Plahl; Ralf Schlueter;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类通信;
关键词
GMM-HMM; VTLN; SAT; CMLLR; LVCSR; MLP; dempster-shafer;

机译：GMM-HMM; VTLN; SAT; CMLLR; LVCSR; MLP;除臭器;

相似文献

外文文献
中文文献
专利

1. Emotional speech feature normalization and recognition based on speaker-sensitive feature clustering [J] . Chengwei Huang, Baolin Song, Li Zhao International journal of speech technology . 2016,第4期

机译：基于说话人敏感特征聚类的情感语音特征归一化与识别
2. Emotion recognition improvement using normalized formant supplementary features by hybrid of DTW-MLP-GMM model [J] . Davood Gharavian, Mansour Sheikhan, Farhad Ashoftedel Neural Computing and Applications . 2013,第6期

机译：DTW-MLP-GMM模型的混合使用归一化共振峰补充特征改进情绪识别
3. Emotion recognition improvement using normalized formant supplementary features by hybrid of DTW-MLP-GMM model [J] . Davood Gharavian, Mansour Sheikhan, Farhad Ashoftedel Neural computing & applications . 2013,第6期

机译：DTW-MLP-GMM模型的混合使用归一化共振峰补充特征改进情绪识别
4. Speaker adaptive bottleneck features extraction for LVCSR based on discriminative learning of speaker codes [C] . Kong Changqing, Xue Shaofei, Gao Jianqing, International Symposium on Chinese Spoken Language Processing . 2014

机译：基于说话人代码判别学习的LVCSR说话人自适应瓶颈特征提取
5. Acoustic-feature-based frequency warping for speaker normalization. [D] . Gouvea, Evandro Bacci. 1999

机译：基于声音特征的频率扭曲，用于扬声器归一化。
6. MLPAnalyzer: Data Analysis Tool for Reliable Automated Normalization of MLPA Fragment Data [O] . Jordy Coffa, Mark A. van de Wiel, Begoña Diosdado, 2008

机译：MLPAnalyzer：用于可靠地自动归一化MLPA片段数据的数据分析工具
7. Comparison and combination of different CRBE based MLP features for LVCSR [O] . Tüske Zoltán, Schlüter Ralf, Ney Hermann 2012

机译：LVCSR的基于CRBE的不同MLP功能的比较和组合

A study on speaker normalized MLP features in LVCSR

摘要

著录项

相似文献

相关主题

期刊订阅