首页> 外文期刊>Pattern recognition letters >Correlation based speech-video synchronization
【24h】

Correlation based speech-video synchronization

机译:基于相关的语音视频同步

获取原文
获取原文并翻译 | 示例

摘要

This paper presents a novel Up synchronization technique which investigates the correlation between the speech and lips movements. First, the speech signal is represented as a nonlinear time-varying model which involves a sum of AM-FM signals. Each of these signals is employed to model a single Formant frequency. The model is realized using Taylor series expansion in a way which provides the relationship between the lip shape (width and height) w.r.t. the speech amplitude and instantaneous frequency. Using lips width and height, a semi-speech signal is generated and correlated with the original speech signal over a span of delays then the delay between the speech and the video is estimated. Using real and noisy data from the VidTimit and in-house diastases, the proposed method was able to estimate small delays of 0.01-0.1 s in the case of noise-less and noisy signals respectively with a maximum absolute error of 0.0022 s.
机译:本文提出了一种新颖的上同步技术,该技术研究了语音和嘴唇运动之间的相关性。首先,语音信号被表示为一个非线性时变模型,其中包括AM-FM信号之和。这些信号中的每一个都用于模拟单个共振峰频率。该模型是通过泰勒级数展开实现的,该方式提供了唇形(宽度和高度)w.r.t之间的关系。语音幅度和瞬时频率。使用嘴唇的宽度和高度,生成半语音信号,并在一段延迟范围内将其与原始语音信号相关联,然后估算语音和视频之间的延迟。利用来自VidTimit的真实和有噪声的数据以及内部的扩散信号,在无噪声和有噪声的信号分别具有0.0022 s的最大绝对误差的情况下,该方法能够估计0.01-0.1 s的小延迟。

著录项

  • 来源
    《Pattern recognition letters》 |2011年第6期|p.780-786|共7页
  • 作者单位

    School of Electrical, Electronic and Computer Engineering, The University of Western Australia, 35 Stirling Highway Crawley, WA 6009, Australia,School of Computer Science and Software Engineering, The University of Western Australia, 35 Stirling Highway Crawley, WA 6009, Australia;

    School of Computer Science and Software Engineering, The University of Western Australia, 35 Stirling Highway Crawley, WA 6009, Australia;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    correlation; lip sync; formants; estimation; am; fm;

    机译:相关性唇型同步;共振峰估计上午;调频;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号