An Integrated Hybrid CNN-RNN Model for Visual Description and Generation of Captions

Khamparia Aditya; Pandey Babita; Tiwari Shrasti; Gupta Deepak; Khanna Ashish; Rodrigues Joel J. P. C.

首页> 外文期刊>Circuits, systems, and signal processing >An Integrated Hybrid CNN-RNN Model for Visual Description and Generation of Captions

【24h】

An Integrated Hybrid CNN-RNN Model for Visual Description and Generation of Captions

机译：用于视觉描述和字幕的集成混合CNN-RNN模型

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Video captioning is currently considered to be one of the simplest ways to index and search data efficiently. In today's era, suitable captioning of video images can be facilitated with deep learning architectures. The focus of past research has been on providing image captions; however, the generation of high-quality captions with suitable semantics for different scenes has not yet been achieved. Therefore, this work aims to generate well-defined and meaningful captions to images and videos by using convolutional neural networks (CNN) and recurrent neural networks in combination. Beginning with the available dataset, features of images and videos were extracted using CNN. The extracted feature vectors were then utilized to generate a language model with the involvement of long short-term memory for individual word grams. The generated meaningful captions were trained using a softmax function, for performance computation using some predefined evaluation metrics. The obtained experimental results demonstrate that the proposed model outperforms existing benchmark models.

机译：视频标题目前被认为是有效索引和搜索数据的最简单方法之一。在今天的时代，可以通过深度学习架构促进适用的视频图像标题。过去研究的重点是提供图像标题;然而，尚未实现为不同场景具有合适语义的高质量标题。因此，这项工作旨在通过使用卷积神经网络（CNN）和反复性神经网络组合来生成图像和视频的明确定义和有意义的标题。从可用数据集开始，使用CNN提取图像和视频的功能。然后利用提取的特征向量来生成语言模型，其中包括单词克的长短短期记忆。使用Softmax函数训练产生的有意义的标题，用于使用一些预定义评估度量的性能计算。所获得的实验结果表明，所提出的模型优于现有的基准模型。

著录项

来源
《Circuits, systems, and signal processing》 |2020年第2期|776-788|共13页
作者
Khamparia Aditya; Pandey Babita; Tiwari Shrasti; Gupta Deepak; Khanna Ashish; Rodrigues Joel J. P. C.;
展开▼
作者单位

Lovely Profess Univ Sch Comp Sci & Engn Phagwara Punjab India;

Babasaheb Bhimrao Ambedkar Univ Dept Comp Sci & IT Satellite Campus Amethi UP India;

Lovely Profess Univ Div Examinat Phagwara Punjab India;

Maharaja Agrasen Inst Technol Delhi India;

Maharaja Agrasen Inst Technol Delhi India;

Fed Univ Piaui UFPI Teresina PI Brazil|Inst Telecomunicacoes Lisbon Portugal;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Captions; Long short-term memory; Convolutional neural network; Recurrent neural network; Feature vectors; Extraction;

机译：标题;长期内存;卷积神经网络;复发性神经网络;特征向量;提取;

相似文献

外文文献
中文文献
专利

1. Declarative model description and code generation for hybrid individual- and population-based simulations of the early visual system [J] . Ralf Ansorg, Lars Schwabe BMC Neuroscience . 2009,第SUPPLEMENTa1期

机译：用于混合性的陈述模型描述和基于人口的早期视觉系统模拟的代码生成
2. Clothes image caption generation with attribute detection and visual attention model [J] . Li Xianrui, Ye Zhiling, Zhang Zhao, Pattern recognition letters . 2021,第Jana期

机译：衣服图像标题生成，具有属性检测和视觉注意模型
3. Modeling and simulation of Hybrid Energy Storage System (HESS): Integrated Renewable Energy Generation System (REGS) ForGrid [J] . Virendra Sharma, Praveen Kumar Jangir, Lata Gidwani International Journal of Engineering Research and Applications . 2019,第7S1期

机译：混合储能系统（HESS）的建模和仿真：集成可再生能源发电系统（REGS）ForGrid
4. Visual attention based on long-short term memory model for image caption generation [C] . Shiru Qu, Yuling Xi, Songtao Ding Chinese Control and Decision Conference . 2017

机译：基于长期记忆模型的视觉注意力用于图像字幕生成
5. Effect of Image Captioning with Description on the Working Memory [D] . Uppara, Nithiya Shree. 2021

机译：图像字幕与工作记忆描述的影响
6. An Attention Mechanism Oriented Hybrid CNN-RNN Deep Learning Architecture of Container Terminal Liner Handling Conditions Prediction [O] . Bin Li, Yuqing He 2021

机译：一种注意机制型混合CNN-RNN集装箱终端衬里处理条件预测的深度学习架构
7. Declarative model description and code generation for hybrid individual- and population-based simulations of the early visual system [O] . Ralf Ansorg, Lars Schwabe 2009

机译：声明性模型描述和代码生成，用于早期视觉系统的基于个人和人口的混合模拟
8. Hybrid Single-Particle Lagrangian Integrated Trajectories (HY-SPLIT): Model Description [R] . Draxler, R. R. 1988

机译：混合单粒子拉格朗日综合轨迹（HY-spLIT）：模型描述

An Integrated Hybrid CNN-RNN Model for Visual Description and Generation of Captions

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅