首页> 美国卫生研究院文献>The Journal of the Acoustical Society of America >A procedure for estimating gestural scores from speech acoustics
【2h】

A procedure for estimating gestural scores from speech acoustics

机译:从语音声学估计手势分数的过程

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Speech can be represented as a constellation of constricting vocal tract actions called gestures, whose temporal patterning with respect to one another is expressed in a gestural score. Current speech datasets do not come with gestural annotation and no formal gestural annotation procedure exists at present. This paper describes an iterative analysis-by-synthesis landmark-based time-warping architecture to perform gestural annotation of natural speech. For a given utterance, the Haskins Laboratories Task Dynamics and Application (TADA) model is employed to generate a corresponding prototype gestural score. The gestural score is temporally optimized through an iterative timing-warping process such that the acoustic distance between the original and TADA-synthesized speech is minimized. This paper demonstrates that the proposed iterative approach is superior to conventional acoustically-referenced dynamic timing-warping procedures and provides reliable gestural annotation for speech datasets.
机译:语音可以表示为收缩的称为手势的声道动作的星座,其手势相对于彼此的时间模式表示。当前的语音数据集不带有手势注释,并且目前不存在正式的手势注释程序。本文描述了一种基于地标的基于时间的综合合成迭代分析,以执行自然语音的手势注释。对于给定的语音,采用了Haskins实验室的任务动态和应用(TADA)模型来生成相应的原型手势分数。通过迭代定时扭曲过程在时间上优化手势评分,以使原始语音与TADA合成语音之间的声学​​距离最小。本文证明了所提出的迭代方法优于常规的声学参考动态时序规整程序,并为语音数据集提供了可靠的手势注释。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号