Text-Independent Speaker Recognition in Clean and Noisy Backgrounds Using Modified VQ-LBG Algorithm

Mallikarjunan M.; Radha P. Karmali; Bharath K. P.; Muthu Rajesh Kumar

首页> 外文期刊>Circuits, systems, and signal processing >Text-Independent Speaker Recognition in Clean and Noisy Backgrounds Using Modified VQ-LBG Algorithm

【24h】

Text-Independent Speaker Recognition in Clean and Noisy Backgrounds Using Modified VQ-LBG Algorithm

机译：使用修改的VQ-LBG算法在清洁和嘈杂的背景中独立于文本的扬声器识别

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Speaker recognition is the process of identifying the proper speaker by analyzing the spectral shape of the speech signal. This process is done by extracting the desired features and matching the features of the speech signal. In this paper, we adopted the Mel frequency cepstrum coefficient (MFCC) technique for extracting the features from the speaker speech sample. These cepstrum coefficients are named as extracted features. The extracted MFCC features are given as input to the modified vector quantization via Linde-Buzo-Gray (modified VQ-LBG) process and expectation maximization (EM) algorithm. Vector quantization technique is mainly used for feature matching where a separate codebook will be generated for each speaker. The EM algorithm is utilized to develop the Gaussian mixture model-universal background model (GMM-UBM). In GMM-UBM model, k means cluster is summed up to consolidate data about the covariance structure of the information and the focuses of the inert Gaussians. From our analysis, the modified VQ-LBG algorithm gives better performance compared to the GMM-UBM model.

机译：扬声器识别是通过分析语音信号的光谱形状来识别适当扬声器的过程。通过提取所需特征并匹配语音信号的特征来完成该过程。在本文中，我们采用MEL频率谱系码（MFCC）技术从扬声器语音样本中提取特征。这些Cepstrum系数被命名为提取的功能。提取的MFCC特征通过Linde-Buzo-灰度（修改的VQ-LBG）处理和期望最大化（EM）算法作为改进的矢量量化的输入。向量量化技术主要用于特征匹配，其中将为每个扬声器生成单独的码本。利用EM算法开发高斯混合模型 - 通用背景模型（GMM-UBM）。在GMM-UBM模型中，k表示集群总结，以巩固关于信息协方差结构的数据和惰性高斯的焦点。从我们的分析中，与GMM-UBM模型相比，修改的VQ-LBG算法提供了更好的性能。

著录项

来源
《Circuits, systems, and signal processing》 |2019年第6期|2810-2828|共19页
作者
Mallikarjunan M.; Radha P. Karmali; Bharath K. P.; Muthu Rajesh Kumar;
展开▼
作者单位

Vellore Inst Technol Sch Elect Engn Vellore 632014 Tamil Nadu India;

Vellore Inst Technol Sch Elect Engn Vellore 632014 Tamil Nadu India;

Vellore Inst Technol Sch Elect Engn Vellore 632014 Tamil Nadu India;

Vellore Inst Technol Sch Elect Engn Vellore 632014 Tamil Nadu India;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
MFCC; Modified VQ-LBG; Feature extraction; GMM-UBM;

机译：MFCC;修改的VQ-LBG;特征提取;GMM-UBM;

相似文献

外文文献
中文文献
专利

1. Text-Independent Speaker Recognition in Clean and Noisy Backgrounds Using Modified VQ-LBG Algorithm [J] . Mallikarjunan M., Radha P. Karmali, Bharath K. P., Circuits, systems, and signal processing . 2019,第6期

机译：使用改进的VQ-LBG算法在干净和嘈杂的背景下识别与文本无关的说话人
2. ROBUST FEATURES FOR NOISY TEXT-INDEPENDENT SPEAKER IDENTIFICATION USING GFCC ALGORITHM COMBINED TO VAD AND CMN TECHNIQUES [J] . E. B. TAZI, A. BENABBOU, M. HARTI Journal of Theoretical and Applied Information Technology . 2012,第2期

机译：使用GFCC算法与VAD和CMN技术相结合的用于嘈杂的文本独立说话人的鲁棒功能
3. Efficient text-independent speaker recognition with short utterances in both clean and uncontrolled environments [J] . Rania Chakroun, Mondher Frikha Multimedia Tools and Applications . 2020,第29a30期

机译：在干净和不受控制的环境中具有短语的高效文本独立扬声器识别
4. Text-independent speaker identification in noisy background [C] . XU Boling, XU Boling, ZHOU Yi International workshop on modern acoustics non-destructive evaluation (Non-destructive evaluation) . 2000

机译：在嘈杂的背景中独立于文本的扬声器识别
5. Text-independent Speaker Recognition Using Discriminative Subspace Analysis [D] . Jiang, Weiwu 2012

机译：区分子空间分析的文本无关说话人识别
6. Towards understanding speaker discrimination abilities in humans and machines for text-independent short utterances of different speech styles [O] . Soo Jin Park, Gary Yeung, Neda Vesselinova, -1

机译：旨在理解人和机器中说话者的辨别能力以实现不同语音风格的与文本无关的简短发声
7. Robust Text-independent Speaker Recognition with Short Utterance in Noisy Environment Using SVD as a Matching Measure [O] . Aldhaheri Rabah W., Al-Saadi Fuad E. 2004

机译：使用SVD作为匹配措施的嘈杂环境中具有短说话时间的鲁棒文本无关的说话人识别

Text-Independent Speaker Recognition in Clean and Noisy Backgrounds Using Modified VQ-LBG Algorithm

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅