首页> 外文会议>International Conference on Pattern Recognition >Recovering audio-to-video synchronization by audiovisual correlation analysis
【24h】

Recovering audio-to-video synchronization by audiovisual correlation analysis

机译:通过视听相关分析恢复音频到视频同步

获取原文

摘要

Audio-to-video synchronization (AV-sync) may drift and is difficult to recover without dedicated human effort. In this work, we develop an interactive method to recover the drifted AV-sync by audiovisual correlation analysis. Given a video segment, a user specifies a rough time span during which a person is speaking. Our system first detects a speaker region using face detection. It then does a two-stage search to find the optimum AV-drift that can maximize the average audiovisual correlation inside the speaker region. The correlation is evaluated using quadratic mutual information with kernel density estimation. AV-sync is finally recovered by the detected optimum AV-drift. Experimental results demonstrate the effectiveness of our method.
机译:音频到视频同步(AV-SYNC)可能会漂移,并且难以恢复,而无需专注的人力努力。在这项工作中,我们开发了通过视听相关分析来恢复漂移的AV-Sync的交互方法。给定视频段,用户指定一个人正在讲的时间跨度。我们的系统首先使用面部检测来检测扬声器区域。然后,两阶段搜索可以找到最佳的AV漂移,可以最大化扬声器区域内的平均视听相关性。使用具有核密度估计的二次互联信息来评估相关性。检测到的最佳AV漂移最终恢复AV-Sync。实验结果表明了我们方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号