首页> 外文会议>International Conference on Electrical, Computer and Communication Engineering >Effects of Different coefficients on MFCC and PLP for Bangla Speech Corpus using Tied-state Triphone Model
【24h】

Effects of Different coefficients on MFCC and PLP for Bangla Speech Corpus using Tied-state Triphone Model

机译:系态三音模型对孟加拉语语料库MFCC和PLP的不同系数的影响

获取原文

摘要

This paper has observed the effects of different coefficients for MFCC and PLP feature extraction techniques for Bangla corpus System. We have first observed the effects of 12 coefficients for every 10 ms frames, and then added the delta and accelerating coefficients to get 24 and 36 coefficient vectors per frame respectively. Then we have also observed the effect of appending the power coefficient and its first and second derivative while getting a 39 coefficient feature vector per frame. In addition, we have further appended 13 third differential coefficients to make a vector set of 52 coefficients per frame to observe the effect of third differential coefficients too. From the experimental results, we have observed that for gender unbiased models, delta addition has shown the maximum detection both for speaker dependent and independent system. But for speaker independent gender biased models, acceleration, power, and third differential coefficients addition have increased the detection for both MFCC and PLP in noise-free audio samples with the sampling rate of 44.1 KHz.
机译:本文观察了孟加拉语语料库系统中不同系数对MFCC和PLP特征提取技术的影响。我们首先观察到每10 ms帧有12个系数的影响,然后将增量系数和加速系数相加得出每帧分别有24个和36个系数向量。然后,我们还观察到了在每帧获得39个系数特征向量时附加功率系数及其一阶和二阶导数的效果。此外,我们还附加了13个第三微分系数,以构成每帧52个系数的向量集,从而也可以观察到第三微分系数的效果。从实验结果中,我们已经观察到,对于性别无偏模型,增量相加显示了针对说话者相关系统和独立系统的最大检测率。但是,对于独立于说话人的性别偏见模型,加速度,功率和三次微分系数的增加增加了对44.1 KHz采样率的无噪声音频样本中MFCC和PLP的检测。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号