首页> 外国专利> Systems and methods for multimodal multilabel tagging of video

Systems and methods for multimodal multilabel tagging of video

机译：用于视频的多模式多标签标记的系统和方法

页面导航

摘要
著录项
相似文献

摘要

Multimodal multilabel tagging of video content may include labeling the video content with topical tags that are identified based on extracted features from two or more modalities of the video content. The two or more modalities may include (i) a video modality for the object, images, and/or visual elements of the video content, (ii) a text modality for the speech, dialog, and/or text of the video content, and/or (iii) an audio modality for non-speech sounds and/or sound characteristics of the video content. Combinational multimodal multilabel tagging may include combining two or more features from the same or different modality in order to increase the contextual understanding of the features and generate contextually relevant tags. Video content may be labeled with global tags relating to overall topics of the video content, and different sets of local tags relating to topics at different segments of the video content.

机译：视频内容的多模式多标签标记可以包括用基于来自视频内容的两个或更多模式的提取特征来标记具有局部标签的视频内容。两个或更多个模态可以包括（i）视频内容的对象，图像和/或视觉元素的视频模型，（ii）用于语音，对话框和/或视频内容的文本的文本模态，和/或（iii）用于非语音声音的音频模型和/或视频内容的声音特性。组合多媒体多标签标记可以包括组合来自相同或不同的模态的两个或更多个特征，以便增加特征的上下文理解并生成上下文相关标签。视频内容可以用与视频内容的整体主题有关的全局标签标记，以及与视频内容的不同段的主题有关的不同本地标签集。

著录项

公开/公告号US10965999B2

专利类型
公开/公告日2021-03-30

原文格式PDF
申请/专利权人 OATH INC.;
展开▼

申请/专利号US202016806544
发明设计人 AASISH PAPPU;AKSHAY SONI;PALOMA DE JUAN;
展开▼

申请日2020-03-02
分类号H04N5/445;H04N21/8405;H04N21/845;G06K9;G06F40/117;G06F40/169;
国家 US
入库时间 2024-06-14 21:23:27

相似文献

专利
外文文献
中文文献