首页> 外文会议>Text, speech and dialogue >Voice Assessment of Speakers with Laryngeal Cancer by Glottal Excitation Modeling Based on a 2-Mass Model
【24h】

Voice Assessment of Speakers with Laryngeal Cancer by Glottal Excitation Modeling Based on a 2-Mass Model

机译:基于2-Mass模型的声门兴奋模型对喉癌说话者的语音评估

获取原文
获取原文并翻译 | 示例

摘要

The paper investigates the automatic evaluation of voice-related criteria of speakers with laryngeal cancer using a parametric two-mass model of the glottis. In contrast to previous approaches based on automatic speech recognition, the proposed method allows for a distinct evaluation of voice parameters alone since the underlying feature extraction technologies are based on a modeling of the whole vocal tract. This work focuses on the separation of vocal folds and vocal tract by LPC, where the vocal folds are represented by a parametric two-mass model which characterizes the excitation signal. The model parameters are optimized by a data-driven optimization procedure in order to fit the synthetic excitation signal to the LPC residue and the estimated pitch. We found first evidence that the computed parameters are meaningful in form of Pearson correlations between excitation signal parameters and different perceptual voice evaluation criteria in the range of r ≈ |0.7|.
机译:本文研究了使用声门的两个参数模型对喉癌说话者的语音相关标准进行自动评估。与基于自动语音识别的先前方法相比,由于基础特征提取技术基于整个声道的建模,因此所提出的方法仅允许对语音参数进行单独的评估。这项工作着重于LPC对声带和声道的分离,其中声带由表征激励信号的参数两质量模型表示。模型参数通过数据驱动的优化程序进行优化,以使合成激励信号适合LPC残差和估算的音高。我们发现了第一个证据,即在r≈| 0.7 |的范围内,所计算出的参数以激励信号参数与不同感知语音评估标准之间的皮尔逊相关性形式有意义。

著录项

  • 来源
    《Text, speech and dialogue》|2011年|p.348-355|共8页
  • 会议地点 Pilsen(CZ);Pilsen(CZ)
  • 作者单位

    Lehrstuhl fuer Informatik 5 (Mustererkennung) Friedrich-Alexander-Universitaet Erlangen-Niirnberg Martensstr. 3, 91058 Erlangen, Germany;

    Lehrstuhl fuer Informatik 5 (Mustererkennung) Friedrich-Alexander-Universitaet Erlangen-Niirnberg Martensstr. 3, 91058 Erlangen, Germany;

    Lehrstuhl fuer Informatik 5 (Mustererkennung) Friedrich-Alexander-Universitaet Erlangen-Niirnberg Martensstr. 3, 91058 Erlangen, Germany;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 人工智能理论;
  • 关键词

    glottal excitation; voice modeling; perceptual evaluation;

    机译:声门激发;语音建模;知觉评估;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号