首页> 外国专利> Video retrieval based on encoding temporal relationships among video frames

Video retrieval based on encoding temporal relationships among video frames

机译：基于视频帧中的编码时间关系的视频检索

页面导航

摘要
著录项
相似文献

摘要

Systems and methods for content-based video retrieval are described. The systems and methods may break a video into multiple frames, generate a feature vector from the frames based on the temporal relationship between them, and then embed the feature vector into a vector space along with a vector representing a search query. In some embodiments, the video feature vector is converted into a text caption prior to the embedding. In other embodiments, the video feature vector and a sentence vector are each embedded into a common space using a join video sentence embedding model. Once the video and the search query are embedded into a common vector space, a distance between them may be calculated. After calculating the distance between the search query and set of videos, the distances may be used to select a subset of the videos to present as the result of the search.

机译：描述了基于内容的视频检索的系统和方法。系统和方法可以将视频分解为多个帧，基于它们之间的时间关系从帧生成特征向量，然后将特征向量嵌入向量空间以及表示搜索查询的矢量。在一些实施例中，在嵌入之前，视频特征向量被转换为文本标题。在其他实施例中，视频特征向量和句子向量各自使用连接视频句嵌入模型嵌入到公共空间中。一旦将视频和搜索查询嵌入到公共矢量空间中，就可以计算它们之间的距离。在计算搜索查询和一组视频组之间的距离之后，可以使用距离来选择作为搜索结果的视频的子集。

著录项

公开/公告号US11238093B2

专利类型
公开/公告日2022-02-01

原文格式PDF
申请/专利权人 ADOBE INC.;
展开▼

申请/专利号US201916601773
发明设计人 KUMAR AYUSH;HARNISH LAKHANI;ATISHAY JAIN;
展开▼

申请日2019-10-15
分类号G06F16/732;G06N3/08;H04N19/59;G06F16/74;
国家 US
入库时间 2022-08-24 23:35:05

相似文献

专利
外文文献
中文文献