A Comprehensive Survey of Deep Learning for Image Captioning

Hossain Md Zakir; Sohel Ferdous; Shiratuddin Mohd Fairuz; Laga Hamid

首页> 外文期刊>ACM Computing Surveys >A Comprehensive Survey of Deep Learning for Image Captioning

【24h】

A Comprehensive Survey of Deep Learning for Image Captioning

机译：深度学习的图像字幕综合调查

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Generating a description of an image is called image captioning. Image captioning requires recognizing the important objects, their attributes, and their relationships in an image. It also needs to generate syntactically and semantically correct sentences. Deep-learning-based techniques are capable of handling the complexities and challenges of image captioning. In this survey article, we aim to present a comprehensive review of existing deep-learning-based image captioning techniques. We discuss the foundation of the techniques to analyze their performances, strengths, and limitations. We also discuss the datasets and the evaluation metrics popularly used in deep-learning-based automatic image captioning.

机译：生成图像描述称为图像字幕。图像字幕需要识别图像中的重要对象，它们的属性以及它们之间的关系。它还需要生成句法和语义上正确的句子。基于深度学习的技术能够处理图像字幕的复杂性和挑战。在这篇调查文章中，我们旨在对现有的基于深度学习的图像字幕技术进行全面回顾。我们讨论了这些技术的基础，以分析其性能，优势和局限性。我们还将讨论在基于深度学习的自动图像字幕中普遍使用的数据集和评估指标。

著录项

来源
《ACM Computing Surveys》 |2019年第6期|118.1-118.36|共36页
作者
Hossain Md Zakir; Sohel Ferdous; Shiratuddin Mohd Fairuz; Laga Hamid;
展开▼
作者单位

Murdoch Univ Sch Engn & Informat Technol Perth WA 6150 Australia;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Image captioning; deep learning; computer vision; natural language processing; CNN; LSTM;

机译：图片字幕;深度学习计算机视觉;自然语言处理;CNN;LSTM;
入库时间 2022-08-18 05:06:28

相似文献

外文文献
中文文献
专利

1. Features to Text: A Comprehensive Survey of Deep Learning on Semantic Segmentation and Image Captioning [J] . Ariyo Oluwasammi, Muhammad Umar Aftab, Zhiguang Qin, Complexity . 2021,第a期

机译：文字的功能：对语义分割和图像标题深度学习的全面调查
2. Colorizing and Captioning Images Using Deep Learning Models and Deploying Them Via loT Deployment Tools [J] . Krishnamurthi Rajalakshmi, Maheshwari Raghav, Gulati Rishabh International journal of information retrieval research . 2020,第4期

机译：使用深度学习模型并通过批次部署工具部署它们的彩色和标题图像
3. Survey of deep learning and architectures for visual captioning-transitioning between media and natural languages [J] . Sur Chiranjib Multimedia Tools and Applications . 2019,第22期

机译：在媒体和自然语言之间进行视觉字幕转换的深度学习和架构调查
4. A survey on Arabic Image Captioning Systems Using Deep Learning Models [C] . Anfal Attai, Ashraf Elnagar International Conference on Innovations in Information Technology . 2020

机译：使用深层学习模型的阿拉伯图像标题系统调查
5. Generation of Humorous Caption for Cartoon Images Using Deep Learning [D] . Shanmuga Sundaram, Rajesh. 2018

机译：使用深度学习的卡通形象的幽默标题
6. Lung Nodule Detection from Feature Engineering to Deep Learning in Thoracic CT Images: a Comprehensive Review [O] . Amitava Halder, Debangshu Dey, Anup K. Sadhu 2020

机译：肺结结从特征工程到深入学习中的胸部CT图像：全面审查
7. Ensemble Learning on Deep Neural Networks for Image Caption Generation [O] . Harshitha Katpally, Ajay Bansal 2020

机译：用于图像标题生成的深度神经网络的集合学习

A Comprehensive Survey of Deep Learning for Image Captioning

摘要

著录项

相似文献

相关主题

期刊订阅