首页> 外文OA文献 >Acoustical segmental duration or articulatory inter-targets as an indicator of speaker specific kinematic properties
【2h】

Acoustical segmental duration or articulatory inter-targets as an indicator of speaker specific kinematic properties

机译:声音节段的持续时间或关节间目标之间的相互联系,以作为说话人特定运动特性的指标

摘要

The segmental duration is an easily measurable speech parameter on the acoustic signal. Recent studies have shown that segmental duration is speaker specific (Pfitzinger, 2002), and it can be used for the automatic speaker recognition exploiting this speaker specificity (Ferrer et al., 2003). In this report, we discuss the interest in the temporal aspects of speech production in the context of the acoustic-to-articulatory inverse. In fact, its characteristic of speaker specificity suggests its possible link with the kinematic and underlying bio-mechanical properties specific to individual speakers. Everything else being equal, a longer segmental duration can be regarded as the manifestation of either a longer path length between two successive articulatory targets or a slower articulator's speed suggesting a weaker stiffness of the related muscles in bio-mechanical terms. Turning our attention to the inverse problem, the derived kinematic properties may allow us to adapt the control sequence to a specific speaker in connection with a generic articulatory model already adapted to the morphology of that speaker. Moreover, the acoustically derived bio-mechanic properties can provide a reasonable constraint on the possible articulatory trajectories in the speech inversion. In this sturdy, we shall focus our attention to unvoiced sibilant fricatives, /s/ and /∫/, because their segmental duration can be automatically and reliably measured on a large speech database. Actually we have formulated a robust segmentation method with high accuracy.
机译:分段持续时间是声学信号上易于测量的语音参数。最近的研究表明,分段持续时间是特定于说话人的(Pfitzinger,2002),它可以用于利用说话人的特异性进行自动说话人识别(Ferrer等,2003)。在这份报告中,我们讨论了在语音到发音逆过程中语音产生的时间方面的兴趣。实际上,其说话人特异性的特征表明它可能与个别说话人的运动学和潜在的生物力学特性有关。在其他所有条件都相同的情况下,较长的分段持续时间可以视为两个连续咬合目标之间路径长度较长或咬合器速度较慢的表现,这表明从生物力学角度而言,相关肌肉的硬度较弱。将我们的注意力转向反问题,派生的运动学特性可能使我们能够将控制序列与已经适应该说话人形态的通用发音模型相结合,使之适应特定说话人。此外,从声学上得出的生物力学特性可以对语音倒置中可能的发音轨迹提供合理的约束。在这种坚固的情况下,我们将注意力集中在未发声的稳定摩擦音/ s /和/∫/上,因为它们的分段持续时间可以在大型语音数据库中自动可靠地测量。实际上,我们已经制定了一种鲁棒性高,准确度高的分割方法。

著录项

  • 作者

    Toda Martine; Maeda Shinji;

  • 作者单位
  • 年度 2012
  • 总页数
  • 原文格式 PDF
  • 正文语种 en
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号