首页> 外文期刊>Data Science and Engineering >Scalable Multi-grained Cross-modal Similarity Query with Interpretability
【24h】

Scalable Multi-grained Cross-modal Similarity Query with Interpretability

机译:具有可解释性的可扩展多粒度跨模型相似查询

获取原文
获取外文期刊封面目录资料

摘要

Cross-modal similarity query has become a highlighted research topic for managing multimodal datasets such as images and texts. Existing researches generally focus on query accuracy by designing complex deep neural network models and hardly consider query efficiency and interpretability simultaneously, which are vital properties of cross-modal semantic query processing system on large-scale datasets. In this work, we investigate multi-grained common semantic embedding representations of images and texts and integrate interpretable query index into the deep neural network by developing a novel Multi-grained Cross-modal Query with Interpretability (MCQI) framework. The main contributions are as follows: (1) By integrating coarse-grained and fine-grained semantic learning models, a multi-grained cross-modal query processing architecture is proposed to ensure the adaptability and generality of query processing. (2) In order to capture the latent semantic relation between images and texts, the framework combines LSTM and attention mode, which enhances query accuracy for the cross-modal query and constructs the foundation for interpretable query processing. (3) Index structure and corresponding nearest neighbor query algorithm are proposed to boost the efficiency of interpretable queries. (4) A distributed query algorithm is proposed to improve the scalability of our framework. Comparing with state-of-the-art methods on widely used cross-modal datasets, the experimental results show the effectiveness of our MCQI approach.
机译:跨模态相似性查询已成为管理诸如图像和文本等多模式数据集的突出显示的研究主题。通过设计复杂的深度神经网络模型并同时考虑查询效率和解释性,现有研究通常专注于查询精度,这是大规模数据集中的跨模型语义查询处理系统的重要属性。在这项工作中,我们通过开发具有可解释性(MCQI)框架的新型多粒度跨模型查询将可解释的查询索引集成到深神经网络中的多颗粒常见的语义嵌入表示。主要贡献如下:(1)通过集成粗粒细粒和细粒度的语义学习模型,提出了一种多粒跨模型查询处理架构,以确保查询处理的适应性和一般性。 (2)为了捕获图像和文本之间的潜在语义关系,框架结合了LSTM和注意模式,这增强了跨模型查询的查询精度,并构建了解释查询处理的基础。 (3)索引结构和相应的最近邻查询算法提升了解释查询的效率。 (4)提出了一种分布式查询算法,以提高我们框架的可扩展性。与广泛使用的跨模型数据集的最先进方法相比,实验结果表明了MCQI方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号