【24h】

Relating Audio-Visual Events Caused by Multiple Movements: In the Case of Entire Object Movements and Sound Location Changes

机译:与多次移动引起的视听事件相关:在整个对象移动和声音位置变化的情况下

获取原文
获取原文并翻译 | 示例

摘要

Relating audio-visual events is important for constructing for an artificial intelligent system, which can acquire the audio-visual knowledge of movement through active observation without teaching. This paper proposes a method for relating multiple audiovisual events observed by a camera and a microphone according to general laws without object-specific knowledge (including the case of entire object movements and sound location changes). As corresponding cues, we use Gestalt's grouping law; simultaneity of the occurrence of the sound and the change in movement or the same motion starting, similarity of repetition between sound and movement. Based on the correlation coefficient between auditory and visual sequence, the component of frequency at sound onset is related to the short-term space-time invariants (STSTI) of movement. We experimented in the real environment and obtained satisfactory results showing the effectiveness of the proposed method.
机译:关联视听事件对于构建人工智能系统非常重要,该系统可以通过主动观察获得运动的视听知识,而无需进行教学。本文提出了一种在没有特定对象知识的情况下(包括整个对象运动和声音位置变化的情况),根据一般规律将摄像机和麦克风所观察到的多个视听事件进行关联的方法。作为相应的提示,我们使用格式塔的分组定律。声音发生的同时性和运动的变化或相同运动的开始,声音和运动之间重复的相似性。基于听觉和视觉序列之间的相关系数,声音开始时的频率分量与运动的短期时空不变性(STSTI)相关。我们在真实环境中进行了实验,并获得令人满意的结果,表明了该方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号