首页>
外国专利>
WEAKLY-SUPERVISED TEXT-BASED VIDEO MOMENT RETRIEVAL
WEAKLY-SUPERVISED TEXT-BASED VIDEO MOMENT RETRIEVAL
展开▼
机译:基于弱监督的基于文本的视频时刻检索
展开▼
页面导航
摘要
著录项
相似文献
摘要
This application is directed to retrieving a video moment based on text description. An electronic device obtains video content and text description associated with the video moment. The video content includes a plurality of video segments, and the text description including one or more sentences. A plurality of visual features are extracted for the video segments of the video content, and one or more textual features are extracted for the one or more sentences in the text description. The visual features of the plurality of video segments and the textual features of the one or more sentences are combined to generate a plurality of alignment scores. Based on the alignment scores, the electronic device retrieves a subset of the video content from the video segments for the text description.
展开▼