Speaker identification using vector quantization and I-vector with reference to Assamese language

机译：参照阿萨姆语使用矢量量化和I矢量进行说话人识别

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper describes the implementation of a speaker identification system with reference to Assamese language. The database consists of speech samples that were collected from 15 (fifteen) speakers for ten Assamese words representing the Assamese digits from 0 (shounyo) to 9 (no). Mel Frequency Cepstral Coefficients (MFCC) are used as features for this study. Two independent speaker identification systems have been built in this paper using Vector Quantization (VQ) and I-vector technique. The system built using the I-vector technique obtains comparatively better identification accuracy for speakers when compared with the system developed using VQ technique. Three different systems have been built for both the techniques based on variable feature size. A maximum accuracy of 92.38% is achieved using I-vector technique with 39 MFCC features.

机译：本文介绍了参考阿萨姆语的说话人识别系统的实现。该数据库由语音样本组成，这些语音样本是从15位（十五位）说话者收集的10个阿萨姆语单词组成的，这些单词代表从0（shounyo）到9（no）的阿萨姆语数字。梅尔频率倒谱系数（MFCC）被用作本研究的特征。本文使用矢量量化（VQ）和I矢量技术建立了两个独立的说话人识别系统。与使用VQ技术开发的系统相比，使用I矢量技术构建的系统可获得相对更好的说话人识别精度。基于可变特征尺寸，针对这两种技术已经构建了三个不同的系统。使用具有39个MFCC功能的I矢量技术，可以达到92.38％的最大精度。

著录项

来源
《2017 International Conference on Wireless Communications, Signal Processing and Networking》|2017年|164-168|共5页
会议地点 Chennai(IN)
作者
Sruti Sruba Bharali; Sanjib Kr. Kalita;
展开▼
作者单位

Gauhati University, Guwahati, India;

Gauhati University, Guwahati, India;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Mel frequency cepstral coefficient; Speech; Hidden Markov models; Feature extraction; Vector quantization; Training; Databases;

机译：梅尔频率倒谱系数;语音;隐马尔可夫模型;特征提取;矢量量化;训练;数据库;;

相似文献

外文文献
中文文献
专利

1. Sparse coding of i-vector/JFA latent vector over ensemble dictionaries for language identification systems [J] . Om Prakash Singh, Rohit Sinha International journal of speech technology . 2018,第3期

机译：用于语言识别系统的集成字典上i-vector / JFA潜在向量的稀疏编码
2. Combined i-Vector and Extreme Learning Machine Approach for Robust Speaker Identification and Evaluation with SITW 2016, NIST 2008, TIMIT Databases [J] . Al-Kaltakchi Musab T. S., Abdullah Mohammed A. M., Woo Wai L., Circuits, systems and signal processing . 2021,第10期

机译：COMBER 2008，NIST 2008，TIMIT数据库的强大扬声器识别与评估的I-Vector和Extreme Learning Machine方法
3. Comparisons of extreme learning machine and backpropagation-based i-vector approach for speaker identification [J] . Musab T S AL-KALTAKCHI, Ra?d Raf? Omar AL-NIMA, Mohammed A M ABDULLAH Turkish Journal of Electrical Engineering and Computer Sciences . 2020,第3期

机译：扬声器识别极限基于极端学习机和基于BackProjagation的I形载方法的比较
4. Speaker identification using vector quantization and I-vector with reference to Assamese language [C] . Sruti Sruba Bharali, Sanjib Kr. Kalita International Conference on Wireless Communications, Signal Processing and Networking . 2017

机译：使用矢量量化和I形式的扬声器识别，参考assameese语言
5. A Statistical Analysis of Speaker Dependent/Independent Pattern Congruity of Assamese and Bodo Phonemes. [D] . Choudhury, Sangita. 2004

机译：阿萨姆语和博多音素的说话人依存/独立模式一致性的统计分析。
6. Reference Ranges for Serum Uric Acid among Healthy Assamese People [O] . Madhumita Das, N. C. Borah, M. Ghose, 2014

机译：健康的阿萨姆人血清尿酸参考范围
7. Speaker forensic identification using joint factor analysis and i-vector [O] . R J Rouf, D Arifianto 2021

机译：使用联合因子分析和I形载体的扬声器法医识别

Speaker identification using vector quantization and I-vector with reference to Assamese language

摘要

著录项

相似文献

相关主题

期刊订阅