首页> 外文会议> >Robust Analysis and Weighting on MFCC Components for Speech Recognition and Speaker Identification

【24h】

Robust Analysis and Weighting on MFCC Components for Speech Recognition and Speaker Identification

机译：用于语音识别和说话人识别的MFCC组件的稳健分析和加权

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Mismatch between training and testing data is a major error source for both Automatic Speech Recognition (ASR) and Automatic Speaker Identification (ASI). In this paper, we first present a statistical weighting concept to exploit the unequal sensitivity of Mel-Frequency Cepstral Coefficients (MFCC) components to against the mismatch, such as ambient noise, recording equipment, transmission channels, and inter-speaker variations. We further design a new Kullback-Leibler (KL) Distance based weighting algorithm according to the proposed weighting concept to real-world problems in which the label information is often not provided. We examine our algorithm in ASR with mismatch by different speakers and also in ASI with mismatch by channel noises. Experimental results demonstrate the effectiveness and robustness of our proposed method.

机译：培训和测试数据之间的不匹配是自动语音识别（ASR）和自动说话者识别（ASI）的主要错误来源。在本文中，我们首先提出一种统计加权概念，以利用梅尔频率倒谱系数（MFCC）组件对不匹配（例如环境噪声，录音设备，传输通道和扬声器间变化）的不等灵敏度。根据提出的加权概念，我们针对现实世界中通常不提供标签信息的加权概念，进一步设计了一种新的基于Kullback-Leibler（KL）距离的加权算法。我们在不同说话者不匹配的ASR中以及在信道噪声不匹配的ASI中检查我们的算法。实验结果证明了我们提出的方法的有效性和鲁棒性。

著录项

来源
《》|2007年|188-191|共4页
会议地点
作者
Zhou; Xi; Fu; Yun; Liu; Ming; Hasegawa-Johnson; Mark; Huang; Thomas S.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Scale-invariant MFCCs for speech/speaker recognition [J] . Zekeriya TüFEKC?, G?kay D??KEN Turkish Journal of Electrical Engineering and Computer Sciences . 2019,第5期

机译：用于语音/扬声器识别的Scale-Invariant MFCC
2. Robust several-speaker speech recognition with highly dependable online speaker adaptation and identification [J] . Po-Yi Shih, Po-Chuan Lin, Jhing-Fa Wang, Journal of network and computer applications . 2011,第5期

机译：强大的多说话者语音识别功能以及高度可靠的在线说话者自适应和识别功能
3. Enhancement of speech signal denoising based on MFCC and Robust Principal Component Analysis RPCA [J] . Sonia Moussa, Zied Hajaiej, Ali Garsallah International journal of computer science and network security . 2019,第3期

机译：基于MFCC和鲁棒主成分分析RPCA的语音信号去噪增强。
4. Robust Analysis and Weighting on MFCC Components for Speech Recognition and Speaker Identification [C] . Zhou Xi, Fu Yun, Liu Ming, IEEE International Conference on Multimedia and Expo . 2007

机译：用于语音识别和扬声器识别的MFCC组件对MFCC组件的鲁棒分析和加权
5. Robust speech processing based on microphone array, audio-visual, and frame selection for in-vehicle speech recognition and in-set speaker recognition. [D] . Zhang, Xianxian. 2005

机译：基于麦克风阵列，视听和帧选择的强大语音处理功能，可实现车载语音识别和内置说话人识别。
6. Recognizing the message and the messenger: biomimetic spectral analysis for robust speech and speaker recognition [O] . Sridhar Krishna Nemala, Kailash Patil, Mounya Elhilali -1

机译：识别消息和使者：仿生频谱分析可增强语音和说话者识别能力
7. ROBUST ANALYSIS AND WEIGHTING ON MFCC COMPONENTS FOR SPEECH RECOGNITION AND SPEAKER IDENTIFICATION [O] . Xi Zhou, Yun Fu, Ming Liu, 2008

机译：语音识别和说话人识别的MFCC组件的鲁棒分析和加权
8. Robust Speech Processing & Recognition: Speaker ID, Language ID, Speech Recognition/Keyword Spotting, Diarization/Co-Channel/Environmental Characterization, Speaker State Assessment. [R] . Hansen, J. H. 2015

机译：强大的语音处理和识别：说话者ID，语言ID，语音识别/关键字识别，Diarization / Co-Channel /环境表征，说话者状态评估。

Robust Analysis and Weighting on MFCC Components for Speech Recognition and Speaker Identification

摘要

著录项

相似文献

相关主题

期刊订阅