Image Captioning Methods and Metrics

机译：图像标题方法和指标

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Image Captioning is one of the emerging topics of research in the field of AI. It uses a combination of Computer Vision (CV) and Natural Language Processing (NLP) to derive features from the image, use this information to identify objects, actions, their relationships, and generate a description for the image. It is most important concept in artificial intelligence applied in the fields like aid to the blind, self-driving cars, and many more. This paper we demonstrates a concise state of art image captioning and its method for caption generation using deep learning concepts. We also determine the approach for image caption generation using Convolutional Neural Network (CNN) and Generative Adversarial Network (GAN) model in deep learning framework. Using this approach system intelligent enough to create sentences for images. It uses the encoder-decoder architecture, where CNN is used for image vector generation and LSTM is used for the generation of a logical sentence using the NLP concepts. Finally, we evaluate the proposed system experimental analysis with numerous existing systems and show the effeteness of system.

机译：图像标题是AI领域的新兴主题之一。它使用计算机视觉（CV）和自然语言处理（NLP）的组合来从图像中导出功能，使用此信息来识别对象，操作，它们的关系并生成图像的描述。它是人工智能的最重要的概念，适用于盲人，自驾车的援助等领域。本文展示了使用深度学习概念的艺术图像标题的简洁状态及其对字幕生成的方法。我们还确定使用深度学习框架中的卷积神经网络（CNN）和生成的对冲网络（GAN）模型来确定图像标题的方法。使用这种方法系统智能，足以创建图像的句子。它使用编码器解码器架构，其中CNN用于图像向量生成，并且LSTM用于使用NLP概念生成逻辑句子。最后，我们评估了具有许多现有系统的提出的系统实验分析，并显示了系统的效果。

著录项

来源
《International Conference on Emerging Smart Computing and Informatics》|2021年|522-526|共5页
会议地点
作者
Omkar Sargar; Shakti Kinger;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Deep learning; Measurement; Computational modeling; Generative adversarial networks; Natural language processing; Gallium nitride; Artificial intelligence;

机译：深入学习;测量;计算建模;生成的对抗网络;自然语言处理;氮化镓;人工智能;

相似文献

外文文献
中文文献
专利

1. A neural image captioning model with caption-to-images semantic constructor [J] . Su Jinsong, Tang Jialong, Lu Ziyao, Neurocomputing . 2019,第Nova20期

机译：具有字幕到图像语义构造函数的神经图像字幕模型
2. A Caption Text Detection Method from Images/Videos for Efficient Indexing and Retrieval of Multimedia Data [J] . Samabia Tehsin, Asif Masood, Sumaira Kausar, International Journal of Pattern Recognition and Artificial Intelligence . 2015,第1期

机译：从图像/视频的字幕文本检测方法，以有效地索引和检索多媒体数据
3. An Overview of Image Caption Generation Methods [J] . Haoran Wang, Yue Zhang, Xiaosheng Yu Computational intelligence and neuroscience . 2020,第4期

机译：图像字幕生成方法的概述
4. A grey relational analysis based evaluation metric for image captioning and video captioning [C] . Miao Ma, Bolong Wang IEEE International Conference on Grey Systems and Intelligent Services . 2017

机译：基于灰色关联分析的图像字幕和视频字幕评估指标
5. Image Captioning: A Survey of Existing Issues on Datasets, Evaluation Metrics and Methods [D] . zhou, liwan . 2020

机译：图像字幕：对数据集的现有问题，评估度量和方法的调查
6. An Overview of Image Caption Generation Methods [O] . Haoran Wang, Yue Zhang, Xiaosheng Yu 2020

机译：图像字幕生成方法概述
7. Image Captioning with Clause-Focused Metrics in a Multi-modal Setting for Marketing [O] . Philipp Harzig, Dan Zecha, Rainer Lienhart, 2019

机译：用营销的多模态设置中聚焦度量的图像标题

Image Captioning Methods and Metrics

摘要

著录项

相似文献

相关主题

期刊订阅