【24h】

Image Captioning Methods and Metrics

机译:图像标题方法和指标

获取原文

摘要

Image Captioning is one of the emerging topics of research in the field of AI. It uses a combination of Computer Vision (CV) and Natural Language Processing (NLP) to derive features from the image, use this information to identify objects, actions, their relationships, and generate a description for the image. It is most important concept in artificial intelligence applied in the fields like aid to the blind, self-driving cars, and many more. This paper we demonstrates a concise state of art image captioning and its method for caption generation using deep learning concepts. We also determine the approach for image caption generation using Convolutional Neural Network (CNN) and Generative Adversarial Network (GAN) model in deep learning framework. Using this approach system intelligent enough to create sentences for images. It uses the encoder-decoder architecture, where CNN is used for image vector generation and LSTM is used for the generation of a logical sentence using the NLP concepts. Finally, we evaluate the proposed system experimental analysis with numerous existing systems and show the effeteness of system.
机译:图像标题是AI领域的新兴主题之一。它使用计算机视觉(CV)和自然语言处理(NLP)的组合来从图像中导出功能,使用此信息来识别对象,操作,它们的关系并生成图像的描述。它是人工智能的最重要的概念,适用于盲人,自驾车的援助等领域。本文展示了使用深度学习概念的艺术图像标题的简洁状态及其对字幕生成的方法。我们还确定使用深度学习框架中的卷积神经网络(CNN)和生成的对冲网络(GAN)模型来确定图像标题的方法。使用这种方法系统智能,足以创建图像的句子。它使用编码器解码器架构,其中CNN用于图像向量生成,并且LSTM用于使用NLP概念生成逻辑句子。最后,我们评估了具有许多现有系统的提出的系统实验分析,并显示了系统的效果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号