首页> 外文会议>IEEE Conference on Computer Vision and Pattern Recognition Workshops >Spotting Audio-Visual Inconsistencies (SAVI) in Manipulated Video
【24h】

Spotting Audio-Visual Inconsistencies (SAVI) in Manipulated Video

机译:发现操纵视频中的视听不一致(SAVI)

获取原文

摘要

This paper is part of a larger effort to detect manipulations of video by searching for and combining the evidence of multiple types of inconsistencies between the audio and visual channels. Here, we focus on inconsistencies between the type of scenes detected in the audio and visual modalities (e.g., audio indoor, small room versus visual outdoor, urban), and inconsistencies in speaker identity tracking over a video given audio speaker features and visual face features (e.g., a voice change, but no talking face change). The scene inconsistency task was complicated by mismatches in the categories used in current visual scene and audio scene collections. To deal with this, we employed a novel semantic mapping method. The speaker identity inconsistency process was challenged by the complexity of comparing face tracks and audio speech clusters, requiring a novel method of fusing these two sources. Our progress on both tasks was demonstrated on two collections of tampered videos.
机译:本文是通过搜索并组合音频和视频通道之间多种类型不一致的证据来检测视频操纵的一项较大工作的一部分。在这里,我们着眼于在音频和视觉模态(例如,室内音频,小房间与室外视觉,城市)中检测到的场景类型之间的不一致,以及在给定音频扬声器特征和视觉面部特征的情况下,视频中扬声器身份跟踪的不一致(例如,声音发生变化,但说话的脸没有发生变化)。场景不一致任务由于当前视觉场景和音频场景集合中使用的类别不匹配而变得复杂。为了解决这个问题,我们采用了一种新颖的语义映射方法。说话人身份不一致过程受到比较面部轨迹和音频语音簇的复杂性的挑战,这需要一种融合这两种来源的新颖方法。我们在两个被篡改的视频集合中展示了我们在两项任务上的进展。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号