首页> 外文期刊>Pattern recognition letters >Recovery of audio-to-video synchronization through analysis of cross-modality correlation
【24h】

Recovery of audio-to-video synchronization through analysis of cross-modality correlation

机译:通过跨模态相关性分析恢复音视频同步

获取原文
获取原文并翻译 | 示例
           

摘要

Audio-to-video synchronization (AV-sync) may drift and is difficult to recover without time-consuming efforts. Based on analysis of audiovisual correlations, we developed a method of recovering drifted AV-sync in a video clip with only minor human interactions. Users just need to specify the time window . for a stationary speaker. We search the optimum drift within this time window that maximizes the average audiovisual correlation inside the speaker region by shifting audio and computing the correlation for different drift hypotheses, and then recover AV-sync based on the refined optimum drift. The audiovisual correlation was analyzed by Quadratic Mutual Information with Kernel Density Estimation, which is not only robust against audiovisual changes in scale, but also independent of the language. The experimental results demonstrated that our method could effectively recover audio-to-video synchronization.rnA preliminary version of this work was reported at the 2008 IAPR Conference on Pattern Recognition (Liu and Sato, 2008) and won the Best Industry Related Paper Award (B1RPA).
机译:音频到视频的同步(AV-sync)可能会漂移并且如果不花费大量时间就很难恢复。基于对视听相关性的分析,我们开发了一种仅需很少的人机交互即可恢复视频剪辑中漂移的AV同步的方法。用户只需要指定时间窗口即可。用于固定扬声器。我们在此时间窗口内搜索最佳漂移,以通过移动音频并计算不同漂移假设的相关性来最大化扬声器区域内的平均视听相关性,然后基于优化的最佳漂移恢复AV同步。通过具有核密度估计的二次互信息分析了视听相关性,它不仅对视听规模的变化具有鲁棒性,而且与语言无关。实验结果表明我们的方法可以有效地恢复音频到视频的同步。rn该工作的初步版本在2008年IAPR模式识别会议(Liu和Sato,2008年)上得到了报道,并获得了最佳行业相关论文奖(B1RPA) )。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号