首页> 外国专利> WEAKLY-SUPERVISED TEXT-BASED VIDEO MOMENT RETRIEVAL VIA CROSS ATTENTION MODELING

WEAKLY-SUPERVISED TEXT-BASED VIDEO MOMENT RETRIEVAL VIA CROSS ATTENTION MODELING

机译：基于弱监督的基于文本的视频时刻通过跨关注建模检索

页面导航

摘要
著录项
相似文献

摘要

An electronic device obtains video content and a textual query associated with a video moment in the video content. The video content is divided video segments, and the textual query includes one or more words. Visual features are extracted for each video segment, and textual features are extracted for each word. The visual features and the textual features are combined to generate a similarity matrix in which each element represents a similarity level between a respective video segment and a respective word. Segment-attended sentence features are generated for the textual query based on the textual features and the similarity matrix. The segment-attended sentence features are combined with the visual features of the video segments to determine a plurality of alignment scores, which is used to retrieve a subset of the video content associated with the textual query to be retrieved from the video segments.

机译：电子设备获得视频内容和与视频内容中的视频时刻相关联的文本查询。视频内容是划分视频段，文本查询包含一个或多个单词。为每个视频段提取可视特征，为每个单词提取文本特征。组合可视特征和文本特征以生成相似性矩阵，其中每个元素表示相应的视频段和相应字之间的相似度。基于文本特征和相似性矩阵，为文本查询生成段出现的句子功能。分段句子特征与视频段的视觉特征组合以确定多个对准分数，其用于从视频段检索与要检索的文本查询相关联的视频内容的子集。

著录项

公开/公告号WO2021092632A2

专利类型
公开/公告日2021-05-14

原文格式PDF
申请/专利权人 INNOPEAK TECHNOLOGY INC.;
展开▼

申请/专利号USUS2021/019817
发明设计人 CHEN JIAWEI;HSIAO JENHAO;
展开▼

申请日2021-02-26
分类号G06F16/783;
国家 US
入库时间 2022-08-24 18:41:50

相似文献

专利
外文文献
中文文献