首页> 外文会议>International Conference on Computational Science and Its Applications >Human-Like Storyteller: A Hierarchical Network with Gated Memory for Visual Storytelling
【24h】

Human-Like Storyteller: A Hierarchical Network with Gated Memory for Visual Storytelling

机译:类人故事讲述者:一个具有门控记忆的分层网络,用于视觉故事讲述

获取原文

摘要

Different from the visual captioning that describes an image concretely, the visual storytelling aims at generating an imaginative paragraph with a deep understanding of the given image stream. It is more challenging for the requirements of inferring contextual relationships among images. Intuitively, humans tend to tell the story around a central idea that is constantly expressed with the continuation of the storytelling. Therefore, we propose the Human-Like StoryTeller (HLST), a hierarchical neural network with a gated memory module, which imitates the storytelling process of human beings. First, we utilize the hierarchical decoder to integrate the context information effectively. Second, we introduce the memory module as the story's central idea to enhance the coherence of generated stories. And the multi-head attention mechanism with a self adjust query is employed to initialize the memory module, which distils the salient information of the visual semantic features. Finally, we equip the memory module with a gated mechanism to guide the story generation dynamically. During the generation process, the expressed information contained in memory is erased with the control of the read and write gate. The experimental results indicate that our approach significantly outperforms all state-of-the-art (SOTA) methods.
机译:与具体描述图像的视觉字幕不同,视觉故事讲述旨在生成一个对给定图像流有深刻理解的富有想象力的段落。对于推断图像之间上下文关系的要求来说,这更具挑战性。直觉上,人类倾向于围绕一个中心思想来讲述故事,这个中心思想随着故事的延续而不断表达。因此,我们提出了仿人讲故事器(HLST),这是一种带有门控记忆模块的分层神经网络,它模仿人类讲故事的过程。首先,我们利用层次解码器来有效地整合上下文信息。其次,我们引入记忆模块作为故事的中心思想,以增强生成故事的连贯性。该算法采用具有自调整查询的多头注意机制初始化记忆模块,提取视觉语义特征的显著信息。最后,我们为存储模块配备了一个门控机制,以动态引导故事生成。在生成过程中,通过读写门的控制擦除存储器中包含的表示信息。实验结果表明,我们的方法明显优于所有最先进的(SOTA)方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号