首页> 外文会议>IFIP TC 12 international conference on intelligent information processing >Image Semantic Description Based on Deep Learning with Multi-attention Mechanisms
【24h】

Image Semantic Description Based on Deep Learning with Multi-attention Mechanisms

机译:基于深度学习的多注意机制的图像语义描述

获取原文

摘要

In the era of big data, cross-media and multi-modal data are expanding, and data processing methods fail to meet corresponding functional requirements. Aiming at the characteristic of large expression gap of multi-model data, This paper proposes a multimodal data fusion method based on deep learning, which combines the advantages of deep learning in the field of image detection, text sequence prediction, and the multi-attention mechanism. The BLEU algorithm is used to calculate the similarity of four levels of description statements of model output and image. Training and testing were conducted in the Flickr8K data set. Comparing with the traditional single mode state image description method, the experiments show that under the BLEU index, the multi-AM model can achieve better results.
机译:在大数据时代,跨媒体和多模式数据正在扩展,数据处理方法无法满足相应的功能要求。针对多模型数据表达间隙大的特点,提出了一种基于深度学习的多峰数据融合方法,结合了深度学习在图像检测,文本序列预测和多注意领域的优势。机制。 BLEU算法用于计算模型输出和图像的四个描述语句级别的相似度。培训和测试在Flickr8K数据集中进行。与传统的单模态图像描述方法相比,实验表明,在BLEU指标下,多AM模型可以取得较好的效果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号