【24h】

Deep Cross-Media Knowledge Transfer

机译:深度跨媒知识转移

获取原文

摘要

Cross-media retrieval is a research hotspot in multimedia area, which aims to perform retrieval across different media types such as image and text. The performance of existing methods usually relies on labeled data for model training. However, cross-media data is very labor consuming to collect and label, so how to transfer valuable knowledge in existing data to new data is a key problem towards application. For achieving the goal, this paper proposes deep cross-media knowledge transfer (DCKT) approach, which transfers knowledge from a large-scale cross-media dataset to promote the model training on another small-scale cross-media dataset. The main contributions of DCKT are: (1) Two-level transfer architecture is proposed to jointly minimize the media-level and correlation-level domain discrepancies, which allows two important and complementary aspects of knowledge to be transferred: intramedia semantic and inter-media correlation knowledge. It can enrich the training information and boost the retrieval accuracy. (2) Progressive transfer mechanism is proposed to iteratively select training samples with ascending transfer difficulties, via the metric of cross-media domain consistency with adaptive feedback. It can drive the transfer process to gradually reduce vast cross-media domain discrepancy, so as to enhance the robustness of model training. For verifying the effectiveness of DCKT, we take the large-scale dataset XMediaNet as source domain, and 3 widely-used datasets as target domain for cross-media retrieval. Experimental results show that DCKT achieves promising improvement on retrieval accuracy.
机译:跨媒体检索是多媒体领域的一个研究热点,其目的是在不同的媒体类型,如图片和文字来进行检索。现有方法的性能通常依赖于对模型训练的标签数据。然而,跨媒体数据非常费力收集和标签,因此如何以新的数据是对应用的关键问题转移宝贵的知识在现有的数据。为了实现这一目标,本文提出了深跨媒体知识转移(DCKT)方法,该方法的知识转移,从一个大规模的跨媒体数据集,以促进在另一个小规模的跨媒体数据集模型训练。 DCKT的主要贡献是:(1)两级传递体系结构提出了共同最小化媒体级和相关性的电平域的差异,这允许知识的两个重要和互补的方面被转移:媒体内的语义和介质间相关知识。它可以丰富培训信息和提高检索精度。 (2)逐行传送机构,提出了与升序传送困难迭代地选择训练样本,通过所述度量与自适应反馈跨媒体域一致性。它可以驱动移动过程中逐渐减少庞大的跨媒体领域的差异,从而提高模型训练的鲁棒性。为了验证DCKT的有效性,我们采取大规模数据集XMediaNet作为源域,和3被广泛使用的数据集作为跨媒体检索目标域。实验结果表明,DCKT达到承诺的检索精度的提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号