...
首页> 外文期刊>The Journal of the Acoustical Society of America >SIM-simultaneous inverse filtering and matching of a glottal flow model for acoustic speech signals
【24h】

SIM-simultaneous inverse filtering and matching of a glottal flow model for acoustic speech signals

机译:SIM-同时对声语音信号的声门流模型进行逆滤波和匹配

获取原文
获取原文并翻译 | 示例
           

摘要

A new method "simultaneous inverse filtering and model matching" (SIM) is proposed that allows one to calculate voice source measures without any user interaction. it is based on the discrete all-pole modeling (DAP) technique for inverse filtering (IF), which is modified to include a model of the glottal flow as integral part [LF model, Fant et al., STL-QPSR (Stockholm) 4/1985, 1-13 (1986)]. As the correct LF parameters are initially unknown, they are estimated in an iterative procedure using multi-dimensional optimization techniques that are initialized according to the results of an exhaustive search. The error criteria applied reflect how well the IF is performed after the spectral contribution of the glottal flow has been removed. The resulting optimal LF parameter constellation serves as the basis to calculate 11 voice source measures. The performance was evaluated using synthesized signals and recordings of natural utterances. For the synthesized signals, the accuracy to reproduce the original parameters was high (correlations exceeding 0.88) for measures when the starting point of the glottal cycle did not enter explicitly. Errors were smaller compared to conventional estimation methods where the measures were estimated from the IF signal. The analysis of natural utterances indicates that problems still exist with regard to robustness, but that under advantageous conditions the open quotient, the speed quotient, the closing quotient, the parabolic spectral parameter, and the negative peak amplitude of the glottal flow derivative can indeed be determined automatically by the SIM method.
机译:提出了一种新的方法“同时逆滤波和模型匹配”(SIM),该方法无需用户交互即可计算语音源度量。它基于用于逆滤波(IF)的离散全极点建模(DAP)技术,该技术经过修改以包含声门流模型作为整体部分[LF模型,Fant等人,STL-QPSR(Stockholm) 4 / 1985,1-13(1986)]。由于正确的LF参数最初是未知的,因此将使用多维优化技术在迭代过程中对它们进行估算,这些优化技术将根据详尽搜索的结果进行初始化。所应用的误差标准反映了去除声门流频谱贡献后中频的执行情况。得到的最佳LF参数星座图是计算11种语音源度量的基础。使用合成信号和自然发声记录来评估性能。对于合成信号,当声门循环的起点未明确输入时,测量的原始信号再现精度较高(相关性超过0.88)。与传统的估计方法相比,误差较小,传统的估计方法是根据IF信号估计测量值的。对自然话语的分析表明,在鲁棒性方面仍然存在问题,但是在有利条件下,声门流导数的开数,速度数,闭数,抛物线谱参数和负峰值幅度确实可以是由SIM方法自动确定。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号