首页> 外文期刊>ACM Computing Surveys >A Comprehensive Survey of Deep Learning for Image Captioning
【24h】

A Comprehensive Survey of Deep Learning for Image Captioning

机译:深度学习的图像字幕综合调查

获取原文
获取原文并翻译 | 示例
       

摘要

Generating a description of an image is called image captioning. Image captioning requires recognizing the important objects, their attributes, and their relationships in an image. It also needs to generate syntactically and semantically correct sentences. Deep-learning-based techniques are capable of handling the complexities and challenges of image captioning. In this survey article, we aim to present a comprehensive review of existing deep-learning-based image captioning techniques. We discuss the foundation of the techniques to analyze their performances, strengths, and limitations. We also discuss the datasets and the evaluation metrics popularly used in deep-learning-based automatic image captioning.
机译:生成图像描述称为图像字幕。图像字幕需要识别图像中的重要对象,它们的属性以及它们之间的关系。它还需要生成句法和语义上正确的句子。基于深度学习的技术能够处理图像字幕的复杂性和挑战。在这篇调查文章中,我们旨在对现有的基于深度学习的图像字幕技术进行全面回顾。我们讨论了这些技术的基础,以分析其性能,优势和局限性。我们还将讨论在基于深度学习的自动图像字幕中普遍使用的数据集和评估指标。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号