首页> 外文会议>International Conference on Nascent Technologies in the Engineering Field >Comparison of vocal tract shape estimation techniques based on formant frequencies, autocorrelation, covariance and lattice
【24h】

Comparison of vocal tract shape estimation techniques based on formant frequencies, autocorrelation, covariance and lattice

机译:基于共振峰频率,自相关,协方差和晶格的声道形状估计技术的比较

获取原文

摘要

Vocal tract is one of most important system in speech production and it begins at the glottis and ends at the lips. Vocal tract shape (VTS) is defined as varying cross sectional area from glottis-to-lips. Based on literature review it is noted that most of the research work carried out on vocal tract shape estimation (VTSE) is based on Wakita's algorithm which is based on autocorrelation of speech. The objective of this research work is to investigate VTSE based on formant frequencies, autocorrelation, covariance and lattice methods. For validation of results, data available for vocal tract shape for vowels from Magnetic Resonance Imaging (MRI) technique was used. Vowels /a/, /i/, /u/, /o/, vowel-semivowel-vowel utterances /aya/, /awa/ and some VCV syllables /apa/, /uba/ were analyzed for three female and three male speakers. From formant frequency, autocorrelation, covariance and lattice methods satisfactory results were obtained for vowels and semivowels. However, VTS for vowels based on formant frequency technique when compared with the MRI shapes were more realistic. From the investigation for effect of variation in analysis frame length on VTSE, it was observed that, lattice method required minimum analysis frame length compared to autocorrelation, and covariance methods, and estimated areas were more consistent across the analysis frames compared to other methods.
机译:声道是语音产生中最重要的系统之一,它始于声门,止于嘴唇。声道形状(VTS)定义为从声门到嘴唇的横截面积变化。基于文献综述,注意到对声道形状估计(VTSE)进行的大部分研究工作都是基于Wakita算法,该算法基于语音的自相关。这项研究工作的目的是基于共振峰频率,自相关,协方差和晶格方法研究VTSE。为了验证结果,使用了可从磁共振成像(MRI)技术获得的元音的声道形状数据。分析了三位女性和三位男性说话者的元音/ a /,/ i /,/ u /,/ o /,元音-半音-元音/ aya /,/ awa /和一些VCV音节/ apa /,/ uba / 。从共振峰频率,自相关,协方差和晶格方法获得了元音和半元音的满意结果。但是,与共振共振峰形状相比,基于共振峰频率技术的元音的VTS更现实。从对分析帧长度变化对VTSE的影响的调查中可以看出,与自相关和协方差方法相比,晶格方法需要最小的分析帧长度,并且与其他方法相比,整个分析帧中的估计面积更加一致。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号