Learning Generalized Video Memory for Automatic Video Captioning

机译：学习通用视频存储器以进行自动视频字幕

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recent video captioning methods have made great progress by deep learning approaches with convolutional neural networks (CNN) and recurrent neural networks (RNN). While there are techniques that use memory networks for sentence decoding, few work has leveraged on the memory component to learn and generalize the temporal structure in video. In this paper, we propose a new method, namely Generalized Video Memory (GVM), utilizing a memory model for enhancing video description generation. Based on a class of self-organizing neural networks, GVM's model is able to learn new video features incrementally. The learned generalized memory is further exploited to decode the associated sentences using RNN. We evaluate our method on the YouTube2Text data set using BLEU and METEOR scores as a standard benchmark. Our results are shown to be competitive against other state-of-the-art methods.

机译：通过卷积神经网络（CNN）和递归神经网络（RNN）的深度学习方法，最近的视频字幕方法取得了长足的进步。尽管有一些使用内存网络进行句子解码的技术，但很少有工作可以利用内存组件来学习和概括视频中的时间结构。在本文中，我们提出了一种新的方法，即通用视频内存（GVM），该方法利用内存模型来增强视频描述的生成。基于一类自组织神经网络，GVM的模型能够逐步学习新的视频功能。进一步利用学习到的广义记忆，使用RNN对关联的句子进行解码。我们使用BLEU和METEOR得分作为标准基准，在YouTube2Text数据集上评估我们的方法。结果表明，我们的结果与其他最新方法相比具有竞争力。

著录项

来源
《Multi-disciplinary international workshop on artificial intelligence》|2018年|187-201|共15页
会议地点
作者
Poo-Hee Chang; Ah-Hwee Tan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Memory model; Video captioning; Deep learning; Adaptive Resonance Theory; LSTM; CNN;

机译：内存模型;视频字幕;深度学习;自适应共振理论; LSTM;有线电视新闻网;

相似文献

外文文献
中文文献
专利

1. YouTube automatic captions now on 1bn videos [J] . Andy McDonald Digital TV Europe . 2017,第330期

机译：YouTube现在在1BN视频上的自动标题
2. Effects of modality preference and working memory capacity on captioned videos in enhancing L2 listening outcomes [J] . Kam Emily Fen, Liu Yeu-Ting, Tseng Wen-Ta ReCall . 2020,第PTa2期

机译：模态偏好和工作存储器容量对增强L2侦听结果中的标题视频的影响
3. Learning explicit video attributes from mid-level representation for video captioning [J] . Fudong Nian, Teng Li, Yan Wang, Computer vision and image understanding . 2017,第octa期

机译：从中级表示中学习显式视频属性以进行视频字幕
4. Learning Generalized Video Memory for Automatic Video Captioning [C] . Poo-Hee Chang, Ah-Hwee Tan Multi-Disciplinary International Conference on Artificial Intelligence . 2018

机译：学习用于自动视频字幕的广义视频内存
5. The effect of the use of videos captioning on English as a foreign language (EFL) on college students' language learning in Taiwan (China). [D] . Hwang, Yan-Ling. 2003

机译：在台湾（中国）使用视频字幕作为外语英语（EFL）对大学生语言学习的影响。
6. Evaluation of automatic video captioning using direct assessment [O] . Yvette Graham, George Awad, Alan Smeaton 2012

机译：使用直接评估来评估自动视频字幕
7. Accuracy of Sign Interpreting and Real-Time Captioning ofScience Videos for the Delivery of Instruction to Deaf StudentsAccuracy of Sign Interpreting and Real-Time Captioning ofScience Videos for the Delivery of Instruction to Deaf StudentsAccuracy of Sign Interpreting and Real-Time Captioning of Science Videos for the Delivery of Instruction to Deaf Students [O] . Sadler Karen Lee 2009

机译：用于向聋人学生交付教学的科学视频的符号解释和实时字幕的准确性用于向聋人学生交付教学的科学视频的符号解释和实时字幕的准确性向聋哑学生提供指导

Learning Generalized Video Memory for Automatic Video Captioning

摘要

著录项

相似文献

相关主题

期刊订阅