首页> 美国卫生研究院文献>The Journal of the Acoustical Society of America >A generalized smoothness criterion for acoustic-to-articulatory inversion
【2h】

A generalized smoothness criterion for acoustic-to-articulatory inversion

机译:声学-发音反演的广义平滑度准则

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The many-to-one mapping from representations in the speech articulatory space to acoustic space renders the associated acoustic-to-articulatory inverse mapping non-unique. Among various techniques, imposing smoothness constraints on the articulator trajectories is one of the common approaches to handle the non-uniqueness in the acoustic-to-articulatory inversion problem. This is because, articulators typically move smoothly during speech production. A standard smoothness constraint is to minimize the energy of the difference of the articulatory position sequence so that the articulator trajectory is smooth and low-pass in nature. Such a fixed definition of smoothness is not always realistic or adequate for all articulators because different articulators have different degrees of smoothness. In this paper, an optimization formulation is proposed for the inversion problem, which includes a generalized smoothness criterion. Under such generalized smoothness settings, the smoothness parameter can be chosen depending on the specific articulator in a data-driven fashion. In addition, this formulation allows estimation of articulatory positions recursively over time without any loss in performance. Experiments with the MOCHA TIMIT database show that the estimated articulator trajectories obtained using such a generalized smoothness criterion have lower RMS error and higher correlation with the actual measured trajectories compared to those obtained using a fixed smoothness constraint.
机译:从语音发音空间中的表示到声学空间的多对一映射使关联的声学到发音的逆映射变得不唯一。在各种技术中,在发音器轨迹上施加平滑度约束是处理声学到发音反演问题中非唯一性的常用方法之一。这是因为,发音器通常会在语音生成过程中平稳移动。标准的平滑度约束是使发音位置序列差异的能量最小化,以使发音器轨迹平滑且本质上是低通的。对于所有的咬合器,这样的固定的平滑度定义并不总是现实的或适当的,因为不同的咬合器具有不同的平滑度。本文提出了一种针对反演问题的优化公式,其中包括一个广义的光滑度准则。在这种通用的平滑度设置下,可以根据特定的咬合器以数据驱动的方式选择平滑度参数。此外,该公式允许随时间递归估计关节位置,而不会降低性能。使用MOCHA TIMIT数据库进行的实验表明,与使用固定的平滑度约束所获得的轨迹相比,使用这种广义的平滑度标准所获得的估计咬合轨迹具有更低的RMS误差和与实际测量轨迹的相关性更高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号