首页> 外国专利> SPATIO-TEMPORAL INTERACTION NETWORK FOR LEARNING OBJECT INTERACTIONS

SPATIO-TEMPORAL INTERACTION NETWORK FOR LEARNING OBJECT INTERACTIONS

机译:学习对象交互的时空交互网络

摘要

Systems and methods for improving video understanding tasks based on higher-order object interactions (HOIs) between object features are provided. A plurality of frames of a video are obtained. A coarse-grained feature representation is generated by generating an image feature for each of for each of a plurality of timesteps respectively corresponding to each of the frames and performing attention based on the image features. A fine-grained feature representation is generated by generating an object feature for each of the plurality of timesteps and generating the HOIs between the object features. The coarse-grained and the fine-grained feature representations are concatenated to generate a concatenated feature representation.
机译:提供了用于基于对象特征之间的高阶对象交互(HOI)来改善视频理解任务的系统和方法。获得视频的多个帧。通过为分别对应于每个帧的多个时间步的每一个生成图像特征并基于图像特征进行关注,来生成粗粒度特征表示。通过为多个时间步长中的每一个生成对象特征并生成对象特征之间的HOI,可以生成细粒度的特征表示。将粗粒度特征表示和细粒度特征表示连接起来以生成级联特征表示。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号