首页> 外文会议>International Conference on Artificial Neural Networks >Multi-modal Associative Storage and Retrieval Using Hopfield Auto-associative Memory Network
【24h】

Multi-modal Associative Storage and Retrieval Using Hopfield Auto-associative Memory Network

机译:使用Hopfield自动关联内存网络的多模态关联存储和检索

获取原文

摘要

Recently we presented text storage and retrieval in an auto-associative memory framework using the Hopfield neural-network. This realized the ideal functionality of Hopfield network as a content-addressable information retrieval system. In this paper, we extend this result to multi-modal patterns, namely, images with text captions and show that the Hopfield network indeed can store and retrieve such multimodal patterns even in an auto-associative setting. Within this framework, we examine two central issues such as (i) performance characterization to show that the O(N) capacity of the Hopfield network for a network of size N neurons under the Pseudo-inverse learning rule is still retained in the multi-modal case, and (ii) the retrieval dynamics of the multi-modal pattern (i.e., image and caption together) under various types of queries such as image+caption, image only and caption only, in line with a typical multi-modal retrieval system where the entire multi-modal pattern is expected to be retrieved even with a partial query pattern from any of the modalities. We present results related to these two issues on a large database of 7000+ captioned-images and establish the practical scalability of both the storage capacity and the retrieval robustness of the Hopfield network for content-addressable retrieval of multi-modal patterns. We point to the potential of this work to extend to a more wider definition of multi-modality as in multi-media content, with various modalities such as video (image sequence) synchronized with sub-title text, speech, music and non-speech.
机译:最近,我们使用Hopfield神经网络在自动关联内存框架中提出了文本存储和检索。这实现了Hopfield网络作为内容可寻址信息检索系统的理想功能。在本文中,我们将此结果扩展到多模态模式,即具有文本标题的图像,并且表明即使在自动关联设置中也可以存储和检索这种多模码模式。在此框架内,我们研究了两个核心问题,例如(i)性能表征,以表明伪逆学习规则下尺寸N神经元网络的O(n)容量仍然保留在多个 - 模态案例,(ii)在诸如图像+标题,图像的各种类型的查询下的多模模式(即,图像和标题)的检索动态,诸如仅典型的多模态检索即使使用来自任何方式的部分查询模式,也期望整个多模态模式的系统。我们在7000+标题图像的大型数据库上显示了与这两个问题相关的结果,并建立了存储容量的实际可扩展性和Hopfield网络的用于内容可寻址的多模模式检索的霍尔菲尔德网络的实际可扩展性。我们指出了这项工作的潜力,可以扩展到多媒体内容中的多种模式的更广泛定义,具有与子标题文本,语音,音乐和非语音同步的各种模态(图像序列) 。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号