【24h】

Hypernasality Detection Using Zero Time Windowing

机译:使用零时间窗的鼻腔检测

获取原文

摘要

The hypernasality in cleft palate speech is characterized by the presence of nasal peak in the vicinity of first formant of vowel spectrum. A high spectral resolution technique, which can resolve these two peaks, is desirable for the automatic detection of hypernasality. This work uses the zero time windowing (ZTW) technique for the hypernasality detection. In this technique, the speech signal is windowed with a highly decaying impulse-like window of approximately a pitch period size. The technique gives the instantaneous vocal tract spectrum free from the pitch and harmonics effect. The spectral resolution loss due to short windowing is restored by the successive differentiation in frequency domain. The numerator of group delay is used to resolve closely spaced nasal peak and first formant. The cepstral feature is extracted from the instantaneous spectrum and is used for the automatic detection of hypernasality using SVM classifier. The accuracy of classification are 76.51% for the vowel /a/ and 80.36% for the vowel /i/. The accuracy further increases when the proposed feature is fused at score level with the Mel-frequency cepstral coefficient (MFCC) feature.
机译:left裂语音中的鼻音过高的特征在于在元音谱的第一共振峰附近存在鼻峰。能够分辨这两个峰的高光谱分辨率技术对于自动检测鼻腔过敏是合乎需要的。这项工作使用零时窗(ZTW)技术进行鼻感检测。在这种技术中,语音信号用高度衰减的近似音调周期大小的脉冲状窗口来开窗。该技术使瞬时声道频谱不受音高和谐波影响。通过短时窗开窗造成的频谱分辨率损失可通过频域中的连续微分来恢复。群延迟分子用于解析间隔较近的鼻峰和第一共振峰。倒谱特征是从瞬时频谱中提取的,并用于使用SVM分类器自动检测鼻腔充血。元音/ a /和元音/ i /的分类准确度分别为76.51%和80.36%。当将建议的特征与Mel频率倒谱系数(MFCC)特征在得分水平上融合时,精度会进一步提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号