Speaker identification using vector quantization and I-vector with reference to Assamese language

机译：使用矢量量化和I形式的扬声器识别，参考assameese语言

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper describes the implementation of a speaker identification system with reference to Assamese language. The database consists of speech samples that were collected from 15 (fifteen) speakers for ten Assamese words representing the Assamese digits from 0 (shounyo) to 9 (no). Mel Frequency Cepstral Coefficients (MFCC) are used as features for this study. Two independent speaker identification systems have been built in this paper using Vector Quantization (VQ) and I-vector technique. The system built using the I-vector technique obtains comparatively better identification accuracy for speakers when compared with the system developed using VQ technique. Three different systems have been built for both the techniques based on variable feature size. A maximum accuracy of 92.38% is achieved using I-vector technique with 39 MFCC features.

机译：本文介绍了参考assamene语言的扬声器识别系统的实现。该数据库由演讲样本组成，这些语音样本从15个（十五）扬声器收集，以获得从0（Shounyo）到9（否）的issamese数字。 MEL频率患者患者系数（MFCC）用作本研究的特征。本文使用了矢量量化（VQ）和I载体技术，建立了两个独立的扬声器识别系统。与使用VQ技术开发的系统相比，使用I-Vector Techne技术构建的系统对扬声器进行了相对更好的识别准确性。基于变量特征大小的技术建立了三种不同的系统。使用具有39个MFCC功能的I载体技术实现了92.38 ％的最大精度。

著录项

来源
《International Conference on Wireless Communications, Signal Processing and Networking》|2017年|726p|共5页
会议地点
作者
Sruti Sruba Bharali; Sanjib Kr. Kalita;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN92-53;
关键词
Mel frequency cepstral coefficient; Speech; Hidden Markov models; Feature extraction; Vector quantization; Training; Databases;

机译：MEL频率抗肌肉系数;语音;隐马尔可夫模型;特征提取;矢量量化;培训;数据库;

相似文献

外文文献
中文文献
专利

1. Sparse coding of i-vector/JFA latent vector over ensemble dictionaries for language identification systems [J] . Om Prakash Singh, Rohit Sinha International journal of speech technology . 2018,第3期

机译：用于语言识别系统的集成字典上i-vector / JFA潜在向量的稀疏编码
2. Combined i-Vector and Extreme Learning Machine Approach for Robust Speaker Identification and Evaluation with SITW 2016, NIST 2008, TIMIT Databases [J] . Al-Kaltakchi Musab T. S., Abdullah Mohammed A. M., Woo Wai L., Circuits, systems and signal processing . 2021,第10期

机译：COMBER 2008，NIST 2008，TIMIT数据库的强大扬声器识别与评估的I-Vector和Extreme Learning Machine方法
3. Comparisons of extreme learning machine and backpropagation-based i-vector approach for speaker identification [J] . Musab T S AL-KALTAKCHI, Ra?d Raf? Omar AL-NIMA, Mohammed A M ABDULLAH Turkish Journal of Electrical Engineering and Computer Sciences . 2020,第3期

机译：扬声器识别极限基于极端学习机和基于BackProjagation的I形载方法的比较
4. Speaker identification using vector quantization and I-vector with reference to Assamese language [C] . Sruti Sruba Bharali, Sanjib Kr. Kalita 2017 International Conference on Wireless Communications, Signal Processing and Networking . 2017

机译：参照阿萨姆语使用矢量量化和I矢量进行说话人识别
5. A Statistical Analysis of Speaker Dependent/Independent Pattern Congruity of Assamese and Bodo Phonemes. [D] . Choudhury, Sangita. 2004

机译：阿萨姆语和博多音素的说话人依存/独立模式一致性的统计分析。
6. Reference Ranges for Serum Uric Acid among Healthy Assamese People [O] . Madhumita Das, N. C. Borah, M. Ghose, 2014

机译：健康的阿萨姆人血清尿酸参考范围
7. Speaker forensic identification using joint factor analysis and i-vector [O] . R J Rouf, D Arifianto 2021

机译：使用联合因子分析和I形载体的扬声器法医识别

Speaker identification using vector quantization and I-vector with reference to Assamese language

摘要

著录项

相似文献

相关主题

期刊订阅