【24h】

Comparative Study of GMM, DTW, and Ann on Thai Speaker Identification System

机译:GMM,DTW和Ann对泰国说话人识别系统的比较研究

获取原文

摘要

This paper proposes a new investigation on Gaussian mixture model (GMM) by comparing it with some preliminary experiemtns on multilayered perceptron network (MLP) with backpropagation learning algorithm (BKP) and dynamic time warping (DTW) techniques o nthai test-dependent speaker identification system. Three major identification engines are conducted on 50 speakers with isolated digits 0-9. Training and testing utterances were recorded over a five week duration. Furhtermore, three well-known speech features, namely linear predictive coding derived cepstrum (LPCC), postfiltered ceptrum (PFL), and Mel frequency cepstral coefficient (MFCC) were evaluted. From our previous experimetns, the MFCC has given the highest identification rate on DTW and MLP. Therefore, GMM with MFCC feature was experimented and attained 87.54
机译:本文通过与反向感知学习算法(BKP)和动态时间规整(DTW)技术在多层感知器网络(MLP)上的一些初步实验进行比较,提出了一种针对高斯混合模型(GMM)的新研究,该算法基于泰国基于测试的说话者识别系统。三个主要的识别引擎是针对50位扬声器(带有0-9的隔离数字)进行的。在五周的时间内记录了训练和测试的话语。此外,还评估了三个著名的语音特征,即线性预测编码衍生倒谱(LPCC),后滤波感受态(PFL)和梅尔频率倒谱系数(MFCC)。根据我们以前的经验,MFCC在DTW和MLP上的识别率最高。因此,对具有MFCC功能的GMM进行了实验,并获得了87.54

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号