首页> 外国专利> Image captioning with weak supervision

Image captioning with weak supervision

机译：监督不力的图像字幕

页面导航

摘要
著录项
相似文献

摘要

Weak supervision data regarding a target image is obtained and utilized to provide detailed information that supplements global image concepts derived for image captioning. Weak supervision data refers to noisy data that is not closely curated and may include errors. Given a target image, weak supervision data for visually similar images may be collected from sources of weakly annotated images, such as online social networks. Generally, images posted online include "weak" annotations in the form of tags, titles, labels, and short descriptions added by users. Weak supervision data for the target image is generated by extracting keywords for visually similar images discovered in the different sources. Separate independent claims are provided that include: feature extraction; the use of convolutional neural networks (CNN); and, a semantic attention model using weighted keywords. The methods could also make use of a language processing model.

机译：获取有关目标图像的弱监督数据，并将其用于提供详细信息，以补充为导出图像字幕而导出的全局图像概念。监管数据薄弱是指未经精心策划且可能包含错误的嘈杂数据。给定目标图像，可以从诸如在线社交网络之类的弱注释图像源中收集视觉相似图像的弱监督数据。通常，在线发布的图像包括以用户添加的标签，标题，标签和简短描述的形式的“弱”注释。通过提取在不同来源中发现的视觉相似图像的关键字来生成目标图像的弱监督数据。提供了单独的独立声明，包括：特征提取;卷积神经网络（CNN）的使用;以及使用加权关键字的语义关注模型。该方法还可以利用语言处理模型。

著录项

公开/公告号GB2546360B

专利类型
公开/公告日2020-08-19

原文格式PDF
申请/专利权人 ADOBE INC.;
展开▼

申请/专利号GB20160018932
发明设计人 ZHAOWEN WANG;QUANZENG YOU;HAILIN JIN;CHEN FANG;
展开▼

申请日2016-11-09
分类号G06K9/62;G06F40;G06K9/36;
国家 GB
入库时间 2022-08-21 10:59:53

相似文献

专利
外文文献
中文文献