首页> 外文会议>International Conference for Emerging Technology >Natural Language Description for Videos Using NetVLAD and Attentional LSTM
【24h】

Natural Language Description for Videos Using NetVLAD and Attentional LSTM

机译:使用NetVLAD和Attentional LSTM的视频的自然语言描述

获取原文

摘要

Video captioning infers the process of generating textual description from videos which describes the objects and actions present in it. The multimodal information is available on each frame based on texture and time in the video. In video captioning, the tremendous task is generating the caption automatically related to video content precisely. By using advancement in the field of deep learning technology, a model was developed to generates natural-language descriptions for activities in the video is proposed. In our proposed work, the first stage is extracting the key features for machine understandable about the video content using 2D and 3D CNN. The convolutional neural network(CNN) of 2D and 3D is used to extract both the spatial and temporal features respectively for transferring the videos into key features. The extracted features are preprocessed using NetVLAD. After NetVLAD preprocessing, the features are concatenated and given as input into attention based Long-Short Term Memory(aLSTM). aLSTM generates sentences in a sequential manner by selecting the salient features. The expected output of the model is a sentence to describe the contents of the video. The evaluation is done by using Bilingual Evaluation Understudy (BLEU) metrics.
机译:视频字幕说明了从视频生成文本描述的过程,该过程描述了视频中存在的对象和动作。基于视频中的纹理和时间,多帧信息可用于每个帧。在视频字幕中,一项艰巨的任务是精确地自动生成与视频内容相关的字幕。通过利用深度学习技术领域的进步,提出了一种用于生成视频中活动的自然语言描述的模型。在我们提出的工作中,第一步是使用2D和3D CNN提取机器可理解的有关视频内容的关键功能。使用2D和3D的卷积神经网络(CNN)分别提取空间和时间特征,以将视频转换为关键特征。提取的特征使用NetVLAD进行预处理。经过NetVLAD预处理后,将这些功能串联起来,并作为输入输入到基于注意力的长期记忆(aLSTM)中。 aLSTM通过选择突出特征以顺序方式生成句子。模型的预期输出是描述视频内容的句子。通过使用双语评估咨询(BLEU)指标来完成评估。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号