【24h】

SPEAKER RECOGNITION USING GMM

机译:使用GMM识别扬声器

获取原文
           

摘要

The idea of the AUDIO SIGNAL PROCESSING (Speaker Recognition [4] Project) is to implement a recognizer using Matlab which can identify a person by processing his/her voice. The Matlab functions and scripts were all well documented and parameterized in order to be able to use them in the future. The basic goal of our project is to recognize and classify the speeches of different persons. This classification is mainly based on extracting several key features like Mel Frequency Cepstral Coefficients (MFCC [2]) from the speech signals of those persons by using the process of feature extraction using MATLAB. The above features may consists of pitch, amplitude, frequency etc. It can be achieved by using tools like MATLAB. Using a statistical model like Gaussian mixture model (GMM [6]) and features extracted from those speech signals we build a unique identity for each person who enrolled for speaker recognition [4]. Estimation and Maximization algorithm is used, An elegant and powerful method for finding the maximum likelihood solution for a model with latent variables, to test the later speeches against the database of all speakers who enrolled in the database.
机译:音频信号处理(扬声器识别[4]项目)的思想是使用Matlab实现识别器,该识别器可以通过处理人的语音来识别人。 Matlab的函数和脚本都经过了很好的文档编制和参数设置,以便将来可以使用。我们项目的基本目标是识别和分类不同人的讲话。该分类主要基于通过使用MATLAB进行特征提取的过程从这些人的语音信号中提取几个关键特征,如梅尔频率倒谱系数(MFCC [2])。以上功能可能包括音调,幅度,频率等。可以通过使用MATLAB之类的工具来实现。使用像高斯混合模型(GMM [6])这样的统计模型和从这些语音信号中提取的特征,我们为注册说话者识别的每个人建立唯一的身份[4]。使用估计和最大化算法。一种优雅而强大的方法,用于为具有潜在变量的模型寻找最大似然解,以针对所有加入该数据库的说话者的数据库测试后来的讲话。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号