Robust distant speaker recognition based on position-dependent CMN by combining speaker-specific GMM with speaker-adapted HMM

Longbiao Wang; Norihide Kitaoka; Seiichi Nakagawa

首页> 外文期刊>Speech Communication >Robust distant speaker recognition based on position-dependent CMN by combining speaker-specific GMM with speaker-adapted HMM

【24h】

Robust distant speaker recognition based on position-dependent CMN by combining speaker-specific GMM with speaker-adapted HMM

机译：通过结合特定于说话人的GMM和适用于说话人的HMM，基于位置相关的CMN进行鲁棒的远方说话人识别

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

AI期刊论文写作 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In this paper, we propose a robust speaker recognition method based on position-dependent Cepstral Mean Normalization (CMN) to compensate for the channel distortion depending on the speaker position. In the training stage, the system measures the transmission characteristics according to the speaker positions from some grid points to the microphone in the room and estimates the compensation parameters a priori. In the recognition stage, the system estimates the speaker position and adopts the estimated compensation parameters corresponding to the estimated position, and then the system applies the CMN to the speech and performs speaker recognition. In our past study, we proposed a new text-independent speaker recognition method by combining speaker-specific Gaussian mixture models (GMMs) with syllable-based HMMs adapted to the speakers by MAP [Nakagawa, S., Zhang, W., Takahashi, M., 2004. Text-independent speaker recognition by combining speaker-specific GMM with speaker-adapted syllable-based HMM. Proc. ICASSP-2004 1,81-84]. The robustness of this speaker recognition method for the change of the speaking style in close-talking environment was evaluated in (Nakagawa et al., 2004). In this paper, we extend this combination method to distant speaker recognition and integrate this method with the proposed position-dependent CMN. Our experiments showed that the proposed method improved the speaker recognition performance remarkably in a distant environment.

机译：在本文中，我们提出了一种基于位置依赖的倒谱均值归一化（CMN）的鲁棒说话人识别方法，以补偿取决于说话人位置的声道失真。在训练阶段，系统根据扬声器位置（从一些网格点到房间中的麦克风）测量传输特性，并事先估计补偿参数。在识别阶段，系统估计说话者的位置并采用与估计位置相对应的估计补偿参数，然后系统将CMN应用于语音并执行说话者识别。在过去的研究中，我们通过结合特定于说话人的高斯混合模型（GMM）与适用于说话人的基于音节的HMM来提出一种新的独立于文本的说话人识别方法[Nakagawa，S.，Zhang，W.，Takahashi， M.，2004年。通过将特定于说话人的GMM与基于说话人的基于音节的HMM相结合，实现了与文本无关的说话人识别。程序ICASSP-2004 1,81-84]。（Nakagawa et al。，2004）评估了这种说话人识别方法在近距离交谈环境中改变说话风格的鲁棒性。在本文中，我们将这种组合方法扩展到远方说话人识别，并将该方法与提出的位置相关的CMN进行集成。我们的实验表明，该方法在较远的环境下可以显着提高说话人的识别性能。

著录项

来源
《Speech Communication》 |2007年第6期|p. 501-513|共13页
作者
Longbiao Wang; Norihide Kitaoka; Seiichi Nakagawa;
展开▼
作者单位

Department of Information and Computer Sciences, Toyohashi University of Technology, 1-1, Hibarigaoka, Tempaku-cho, Toyohashi, Aichi 441-8580, Japan;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类语言、文字;
关键词
distant speaker recognition; GMM; HMM; position-dependent CMN; sound source estimation;

机译：远方说话人识别GMM HMM位置依赖的CMN声源估计;

相似文献

外文文献
中文文献
专利

1. Text-Independent/Text-Prompted Speaker Recognition by Combining Speaker-Specific GMM with Speaker Adapted Syllable-Based HMM [J] . Seiichi NAKAGAWA, Wei ZHANG, Mitsuo TAKAHASHI IEICE Transactions on Information and Systems . 2006,第3期

机译：通过结合特定于说话人的GMM和基于说话人的基于音节的HMM来实现与文本无关/提示文字的说话人识别
2. Robust Distant Speech Recognition by Combining Variable-trem spectrum Based Position-dependent CMN with Conventional CMN [J] . Longbiao WANG, Seiichi NAKAGAWA, Norihide KITAOKA 電子情報通信学会技術研究報告 . 2008,第551期

机译：结合基于可变频谱的位置相关CMN和常规CMN的鲁棒远程语音识别
3. Robust Distant Speech Recognition by Combining Variable-term spectrum Based Position-dependent CMN with Conventional CMN [J] . Longbiao WANG, Seiichi NAKAGAWA, Norihide KITAOKA 電子情報通信学会技術研究報告. 音声. Speech . 2007,第551期

机译：结合基于可变项频谱的位置相关CMN和常规CMN的鲁棒远程语音识别
4. Robust Distant Speech Recognition by Combining Position-Dependent CMN with Conventional CMN [C] . Longbiao Wang, Kitaoka, N., . 2007

机译：通过结合位置相关的CMN和常规CMN进行鲁棒的远程语音识别
5. GMM-based speaker recognition for mobile embedded systems. [D] . Leung, Cheung-chi. 2004

机译：用于移动嵌入式系统的基于GMM的说话者识别。
6. Regularized Speaker Adaptation of KL-HMM for Dysarthric Speech Recognition [O] . Myungjong Kim, Younggwan Kim, Joohong Yoo, -1

机译：KL-HMM的正则化说话人适应用于音调异常语音识别
7. Robust Speech Recognition by Combining Short-Term and Long-Term Spectrum Based Position-Dependent CMN with Conventional CMN [O] . L. WANG, S. NAKAGAWA, N. KITAOKA 2008

机译：通过将基于短期和长期频谱的位置相关的CMN与传统CMN组合通过鲁棒语音识别
8. Robust Speech Processing & Recognition: Speaker ID, Language ID, Speech Recognition/Keyword Spotting, Diarization/Co-Channel/Environmental Characterization, Speaker State Assessment. [R] . Hansen, J. H. 2015

机译：强大的语音处理和识别：说话者ID，语言ID，语音识别/关键字识别，Diarization / Co-Channel /环境表征，说话者状态评估。

Robust distant speaker recognition based on position-dependent CMN by combining speaker-specific GMM with speaker-adapted HMM

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅