首页> 外文会议>International Conference on Advanced Computing and Communication Systems >Detection and Recognition of Objects in Image Caption Generator System: A Deep Learning Approach
【24h】

Detection and Recognition of Objects in Image Caption Generator System: A Deep Learning Approach

机译:图像字幕生成器系统中对象的检测和识别:一种深度学习方法

获取原文

摘要

Image Caption Generator deals with generating captions for a given image. The semantic meaning in the image is captured and converted into a natural language. The capturing mechanism involves a tedious task that collaborates both image processing and computer vision. The mechanism must detect and establish relationships between objects, people, and animals. The aim of this paper is to detect, recognize and generate worthwhile captions for a given image using deep learning. Regional Object Detector (RODe) is used for the detection, recognition and generating captions. The proposed method focuses on deep learning to further improve upon the existing image caption generator system. Experiments are conducted on the Flickr 8k dataset using python language to demonstrate the proposed method.
机译:图像标题生成器处理为给定图像生成标题。图像中的语义被捕获并转换为自然语言。捕获机制涉及一个繁琐的任务,需要将图像处理和计算机视觉协作。该机制必须检测并建立对象,人和动物之间的关系。本文的目的是使用深度学习为给定图像检测,识别并生成有价值的标题。区域对象检测器(RODe)用于检测,识别和生成字幕。所提出的方法着重于深度学习,以进一步改进现有的图像字幕生成器系统。使用python语言对Flickr 8k数据集进行了实验,以证明所提出的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号