首页> 外国专利> IMAGE CAPTIONING USING WEAK SUPERVISION AND SEMANTIC NATURAL LANGUAGE VECTOR SPACE

IMAGE CAPTIONING USING WEAK SUPERVISION AND SEMANTIC NATURAL LANGUAGE VECTOR SPACE

机译:使用弱监督和语义自然语言向量空间进行图像捕获

摘要

#$%^&*AU2016256753A120170727.pdf#####In a digital media environment to facilitate management of image collections using one or more computing devices, a method to automatically generate image captions using weak supervision data comprising obtaining a target image for caption analysis; applying feature extraction to the target image to generate global concepts corresponding to the image; comparing the target image to images from a source of weakly annotated images to identify visually similar images; building a collection of keywords for the target image indicative of image details by extracting the keywords from the visually similar images; and supplying the collection of keywords indicative of image details as the weak supervision data for caption generation along with the global concepts.Inventors: Wang et al. Title: Image Captioning with Weak Supervision 600 602 Obtain a target image for caption analysis 604 Apply feature extraction to the target image to generate global concepts corresponding to the target image 606 Compare the target image to images from a source of weakly annotated images to identify visually similar images 608 Build a collection of keywords for the target image by extracting the keywords from the visually similar images 610 Supply the collection of keywords for caption generation along with the global concepts. 612 Generate a caption for the target image using the collection of keywords to modulate word weights applied for sentence construction 7a,6
机译:#$%^&* AU2016256753A120170727.pdf #####在数字媒体环境中,以利于使用一个或多个计算设备,一种自动生成图像标题的方法使用弱监督数据,包括获得用于字幕分析的目标图像;将特征提取应用于目标图像以生成全局概念对应于图像;将目标图像与来源的图像进行比较弱注释图像以识别视觉上相似的图像;建立一个集合通过提取关键词来表示图像细节的目标图像关键词从视觉上相似的图像;并提供指示性关键字的集合图像细节作为用于字幕生成的弱监督数据,以及全球概念。发明人:Wang等。标题:具有弱监督的图像字幕600602获取目标图像以进行字幕分析604将特征提取应用于目标图像以生成与目标图像相对应的全局概念606将目标图像与来源的图像进行比较弱注释图像以标识视觉上相似的图像608通过以下方式为目标图像建立关键字集合从视觉相似的图像中提取关键字610提供用于字幕生成的关键字集合以及全球概念。612使用集合为目标图像生成标题关键字以调整应用于句子的单词权重施工7a,6

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号