首页> 外文会议>IEEE International Conference on Acoustics, Speech, and Signal Processing >SPEECH MODELING AND VOICED/UNVOICED/MIXED/SILENCE SPEECH SEGMENTATION WITH FRACTIONALLY GAUSSIAN NOISE BASED MODELS
【24h】

SPEECH MODELING AND VOICED/UNVOICED/MIXED/SILENCE SPEECH SEGMENTATION WITH FRACTIONALLY GAUSSIAN NOISE BASED MODELS

机译:用分馏高斯噪声模型进行语音建模和浊音/无声/混合/沉默语音分割

获取原文

摘要

The ARMA filtered fractionally differenced Gaussian Noise (FdGn) model and a new AR Filtered FdGn Added up model are applied to speech signal and performance of their parameters on speech Unvoiced/Voiced/Mixed/Silence classification is evaluated against Zero Crossing Rate (ZCR) feature. For parameter estimation of AR filtered FdGn model two methods were applied: iterative Maximum Likelihood (ML) method of Tewfik [2] and a new computationally efficient Linear Minimum Square Error (LMSE) algorithm Also for parameters estimation of new Added up model two approaches were implemented: an Expectation-Maximization (EM) based approach and an iterative MSE approach. The described models and methods were applied to speech signal and also its real Cepstrum. The performance of described models on V/U/M/S speech classification was obtained based on J1 parameter in this order: Added up model on real Cepstrum of speech, Filtered FdGn model on real Cepstrum of speech (LMSE method), Filtered FdGn model on speech (LMSE method), ZCR, and Filtered FdGn model on speech (Tewfik method).
机译:过滤分数求差高斯噪声(FdGn)模式和全新的AR过滤FdGn加起来模型ARMA应用于语音信号以及它们对语音参数清音/浊音/混合/静音分类是对过零率计算的性能(ZCR)功能。对于AR的参数估计过滤FdGn模型施加两种方法:Tewfik的迭代最大似然(ML)方法[2]和新的计算上高效的线性最小均方误差(LMSE)算法也用于参数的新累加模型两种方法是估计实现:一个期望最大化(EM)为基础的方法和迭代MSE的方法。所描述的模型和方法应用于语音信号,并其真正的频谱。上U / V描述模型的性能/基于以该顺序J1参数得到M / S语音分类:增加了模型上的语音的真实倒谱,对语音的真实倒谱(LMSE方法),过滤FdGn模型滤波FdGn模型对语音(LMSE方法),ZCR,并过滤FdGn模型对语音(Tewfik方法)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号