首页>
外国专利>
Multi-model techniques to generate video metadata
Multi-model techniques to generate video metadata
展开▼
机译:生成视频元数据的多模型技术
展开▼
页面导航
摘要
著录项
相似文献
摘要
A metadata generation system utilizes machine learning techniques to accurately describe content of videos based on multi-model predictions. In some embodiments, multiple feature sets are extracted from a video, including feature sets showing correlations between additional features of the video. The feature sets are provided to a learnable pooling layer with multiple modeling techniques, which generates, for each of the feature sets, a multi-model content prediction. In some cases, the multi-model predictions are consolidated into a combined prediction. Keywords describing the content of the video are determined based on the multi-model predictions (or combined prediction). An augmented video is generated with metadata that is based on the keywords.
展开▼