Expressing an Image Stream with a Sequence of Natural Sentences

机译：用自然句序列表达图像流

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We propose an approach for retrieving a sequence of natural sentences for an image stream. Since general users often take a series of pictures on their special moments, it would better take into consideration of the whole image stream to produce natural language descriptions. While almost all previous studies have dealt with the relation between a single image and a single natural sentence, our work extends both input and output dimension to a sequence of images and a sequence of sentences. To this end, we design a multimodal architecture called coherence recurrent convolutional network (CRCN), which consists of convolutional neural networks, bidirectional recurrent neural networks, and an entity-based local coherence model. Our approach directly learns from vast user-generated resource of blog posts as text-image parallel training data. We demonstrate that our approach outperforms other state-of-the-art candidate methods, using both quantitative measures (e.g. BLEU and top-K recall) and user studies via Amazon Mechanical Turk.

机译：我们提出了一种检索图像流的自然句子序列的方法。由于一般用户经常在他们的特殊时刻拍摄一系列照片，因此最好考虑整个图像流以产生自然的语言描述。尽管几乎所有以前的研究都处理了单个图像和单个自然句子之间的关系，但我们的工作将输入和输出维度扩展到了图像序列和句子序列。为此，我们设计了一种称为相干递归卷积网络（CRCN）的多模式体系结构，它由卷积神经网络，双向递归神经网络和基于实体的局部相干模型组成。我们的方法直接从大量用户生成的博客文章资源中学习文本和图像并行训练数据。我们证明了我们的方法在定量方法（例如BLEU和top-K召回）以及通过Amazon Mechanical Turk进行的用户研究中均优于其他最新的候选方法。

著录项

来源
《Annual conference on Neural Information Processing Systems》|2015年|73-81|共9页
会议地点
作者
Cesc Chunseong Park; Gunhee Kim;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Interactive Streaming of Sequences of High Resolution JPEG2000 Images [J] . Sanchez-Hernandez J.J., Garcia-Ortiz J.P., Gonzalez-Ruiz Vicente, Multimedia, IEEE Transactions on . 2015,第10期

机译：交互式流传输高分辨率JPEG2000图像序列
2. Component trees for image sequences and streams [J] . Pattern recognition letters . 2020,第Jana期

机译：图像序列和流的组件树
3. RECOGNIZING FACIAL EXPRESSIONS IN IMAGE SEQUENCES USING LOCAL PARAMETERIZED MODELS OF IMAGE MOTION [J] . Black MJ., Yacoob Y. International Journal of Computer Vision . 1997,第1期

机译：使用局部参数化的图像运动模型识别图像序列中的面部表情
4. Expressing an Image Stream with a Sequence of Natural Sentences [C] . Cesc Chunseong Park, Gunhee Kim Annual conference on Neural Information Processing Systems . 2015

机译：用一系列自然句子表达图像流
5. Learned Factorization Models to Explain Variability in Natural Image Sequences. [D] . Culpepper, Benjamin Jackson. 2011

机译：学习的分解模型可以解释自然图像序列的可变性。
6. Temporal Statistics of Natural Image Sequences Generated by Movements with Insect Flight Characteristics [O] . Alexander Schwegmann, Jens Peter Lindemann, Martin Egelhaaf 2010

机译：具有昆虫飞行特征的运动产生的自然图像序列的时间统计
7. Perceptual synchrony of audiovisual streams for natural and artificial motion sequences [O] . Roberto Arrighi, David Alais, David Burr 2009

机译：自然和人工运动序列的视听流的感知同步

Expressing an Image Stream with a Sequence of Natural Sentences

摘要

著录项

相似文献

相关主题

期刊订阅