VOCAL TRACT NORMALIZATION BASED ON SPECTRAL WARPING

机译：基于光谱包裹的声带归一化

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Two techniques for speaker adaptation based on frequency scale modifications are described and evaluated. In one method, minimum mean square error matching is performed between a spectral template for each speaker to a "typical speaker" spectral template. One parameter, a warping factor, is used to control the spectral matching. In the second method, a neural network classifier is used to adjust the frequency warping factor for each speaker so as to maximize vowel classification performance for each speaker. A vowel classifier trained only with normalized female speech and tested only with normalized male speech, or vice versa, is nearly as accurate as when speaker genders are matched for training and testing, and the speech is not normalized. The improvement due to normalization is much smaller, if training and test data are matched. The normalization based on classification performance is superior to that based on minimizing mean square error.

机译：描述和评估了基于频率标度修改的两种说话人自适应技术。在一种方法中，在每个说话者的频谱模板与“典型说话者”频谱模板之间执行最小均方误差匹配。一个参数，即翘曲因子，用于控制光谱匹配。在第二种方法中，使用神经网络分类器来调整每个说话者的频率扭曲因子，以便最大化每个说话者的元音分类性能。仅使用标准化的女性语音训练并且仅使用标准化的男性语音进行测试的元音分类器，其准确度几乎与匹配说话者性别进行训练和测试且语音未进行标准化的准确度差不多。如果训练和测试数据匹配，则归一化带来的改进要小得多。基于分类性能的归一化优于基于最小均方误差的归一化。

著录项

来源
《International Conference on Spoken Language Processing; 20041004-08; Jeju(KR)》|2004年|P.2921-2924|共4页
会议地点 Jeju(KR)
作者
Wei Wang; Stephen A. Zahorian;
展开▼
作者单位

Department of Electrical and Computer Engineering Old Dominion University Norfolk, VA;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类应用语言学;
关键词
入库时间 2022-08-26 14:07:38

相似文献

外文文献
中文文献
专利

1. Evaluation of the Vocal Tract Length Normalization Based Classifiers for Speaker Verification [J] . Walid Hussein, Sarah Akram Essmat, Nestor Yoma, International Journal of Recent Contributions from Engineering, Science & IT . 2016,第4期

机译：用于说话人验证的基于人行道长度归一化分类器的评估
2. Effects of Speaker Normalization Based on Vocal Tract Length Ratios on Word Recognition Using Compound Parameters [J] . Naomitsu Ikeda, Tadashi Sakata, Tomoaki Hirayama, Systems and Computers in Japan . 2005,第12期

机译：基于语音长度比的说话人归一化对复合参数词识别的影响
3. Vocal tract length normalization using linear transformation based on maximum likelihood estimation [J] . Jun Rokui, Nakai Mitsuru, Hiroshi Shimodaira, 電子情報通信学会技術研究報告. 言語理解とコミュニケーション. Natural Language Understanding and Models of Communication . 2001,第520期

机译：基于最大似然估计的线性变换对声道长度进行归一化
4. VOCAL TRACT NORMALIZATION BASED ON SPECTRAL WARPING [C] . Wei Wang, Stephen A. Zahorian, International Speech Communication Association International Conference on Spoken Language Processing . 2004

机译：基于光谱翘曲的声带标准化
5. Frequency warping by linear transformation, and vocal tract inversion for speaker normalization in automatic speech recognition. [D] . Panchapagesan, Sankaran. 2008

机译：通过线性变换实现的频率扭曲和声道反转，可在自动语音识别中实现说话人归一化。
6. A New Method to Explore the Spectral Impact of the Piriform Fossae on the Singing Voice: Benchmarking Using MRI-Based 3D-Printed Vocal Tracts [O] . Bertrand Delvaux, David Howard -1

机译：探索梨状窝对歌声的频谱影响的新方法：使用基于MRI的3D打印声带进行基准测试
7. Evaluation of the Vocal Tract Length Normalization Based Classifiers for Speaker Verification [O] . Walid Hussein, Sarah Akram Essmat, Nestor Yoma, 2016

机译：基于声道长度归一化的说话人验证分类器评估

VOCAL TRACT NORMALIZATION BASED ON SPECTRAL WARPING

摘要

著录项

相似文献

相关主题

期刊订阅