BEYOND CAPTION TO NARRATIVE: VIDEO CAPTIONING WITH MULTIPLE SENTENCES

机译：超越标题叙述：具有多个句子的视频标题

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recent advances in image captioning task have led to increasing interests in video captioning task. However, most works on video captioning are focused on generating single input of aggregated features, which hardly deviates from image captioning process and does not fully take advantage of dynamic contents present in videos. We attempt to generate video captions that convey richer contents by temporally segmenting the video with action localization, generating multiple captions from multiple frames, and connecting them with natural language processing techniques, in order to generate a story-like caption. We show that our proposed method can generate captions that are richer in contents and can compete with state-of-the-art method without explicitly using video-level features as input.

机译：图像标题任务的最新进展导致了越来越多的视频字幕任务的利益。然而，大多数关于视频字幕的作品都集中在生成聚合特征的单个输入，这几乎不会偏离图像标题过程，并且没有充分利用视频中存在的动态内容。我们试图通过用动作定位临时分割视频来生成传送更丰富的内容的视频字幕，从多个帧生成多个字幕，并用自然语言处理技术将它们连接，以生成类似的故事标题。我们表明我们所提出的方法可以生成内容中更丰富的标题，并且可以与最先进的方法竞争，而不明确使用视频级别功能作为输入。

著录项

来源
《IEEE International Conference on Image Processing》|2016年|3214-3856p|共5页
会议地点
作者
Andrew Shin; Katsunori Ohnishi; Tatsuya Harada;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP391.41-53;
关键词
Video caption; Action localization; Natural language processing;

机译：视频标题;行动本地化;自然语言处理;

相似文献

外文文献
中文文献
专利

1. IP Captioning Recon Order Doesn't Require Captions for Online-Only Videos [J] . Monty Tayloe Telecom A.M. . 2013,第116期

机译：IP字幕侦听命令不需要仅在线视频的字幕
2. Multi-Sentence Video Captioning using Content-oriented Beam Searching and Multi-stage Refining Algorithm [J] . Masoomeh Nabati, Alireza Behrad Information Processing & Management . 2020,第6期

机译：使用面向内容的波束搜索和多级炼制算法的多句子视频字幕
3. Automatic sentence partitioning of TV news sentences for closed caption service to hearing impaired people [J] . Terumasa Ehara, Takahiro Fukushima, Yuji Weda, 電子情報通信学会技術研究報告. 言語理解とコミュニケーション. Natural Language Understanding and Models of Communication . 2000,第200期

机译：电视新闻句子的自动句子划分，为听障人士提供隐藏式字幕服务
4. Beyond caption to narrative: Video captioning with multiple sentences [C] . Andrew Shin, Katsunori Ohnishi, Tatsuya Harada IEEE International Conference on Image Processing . 2016

机译：字幕之外的叙事：多字幕视频字幕
5. The effect of the use of videos captioning on English as a foreign language (EFL) on college students' language learning in Taiwan (China). [D] . Hwang, Yan-Ling. 2003

机译：在台湾（中国）使用视频字幕作为外语英语（EFL）对大学生语言学习的影响。
6. Eye movements while viewing narrated captioned and silent videos [O] . Nicholas M. Ross, Eileen Kowler -1

机译：观看旁白字幕和无声视频时的眼球运动
7. Beyond Caption To Narrative: Video Captioning With Multiple Sentences [O] . Shin, Andrew, Ohnishi, Katsunori, Harada, Tatsuya 2016

机译：超越叙事的标题：多重句子的视频字幕

BEYOND CAPTION TO NARRATIVE: VIDEO CAPTIONING WITH MULTIPLE SENTENCES

摘要

著录项

相似文献

相关主题

期刊订阅