首页> 外文期刊>ACM transactions on knowledge discovery from data >Multimodal Data Mining in a Multimedia Database Based on Structured Max Margin Learning
【24h】

Multimodal Data Mining in a Multimedia Database Based on Structured Max Margin Learning

机译:基于结构最大余量学习的多媒体数据库中的多模式数据挖掘

获取原文
获取原文并翻译 | 示例
           

摘要

Mining knowledge from a multimedia database has received increasing attentions recently since huge repositories are made available by the development of the Internet. In this article, we exploit the relations among different modalities in a multimedia database and present a framework for general multimodal data mining problem where image annotation and image retrieval are considered as the special cases. Specifically, the multimodal data mining problem can be formulated as a structured prediction problem where we learn the mapping from an input to the structured and interdependent output variables. In addition, in order to reduce the demanding computation, we propose a new max margin structure learning approach called Enhanced Max Margin Learning (EMML) framework, which is much more efficient with a much faster convergence rate than the existing max margin learning methods, as verified through empirical evaluations. Furthermore, we apply EMML framework to develop an effective and efficient solution to the multimodal data mining problem that is highly scalable in the sense that the query response time is independent of the database scale. The EMML framework allows an efficient multimodal data mining query in a very large scale multimedia database, and excels many existing multimodal data mining methods in the literature that do not scale up at all. The performance comparison with a state-of-the-art multimodal data mining method is reported for the real-world image databases.
机译:由于因特网的发展提供了巨大的存储库,最近从多媒体数据库中挖掘知识已受到越来越多的关注。在本文中,我们利用多媒体数据库中不同模态之间的关系,提出了一个通用的多模态数据挖掘问题的框架,其中图像注释和图像检索被视为特例。具体来说,可以将多模式数据挖掘问题表述为结构化预测问题,在该问题中,我们将学习从输入到结构化且相互依赖的输出变量的映射。此外,为了减少计算量,我们提出了一种新的最大余量结构学习方法,称为增强最大余量学习(EMML)框架,与现有的最大余量学习方法相比,该方法效率更高,收敛速度也更快。通过经验评估得到验证。此外,在查询响应时间与数据库规模无关的意义上,我们应用EMML框架开发了一种高度可扩展的多模式数据挖掘问题的有效解决方案。 EMML框架允许在非常大型的多媒体数据库中进行有效的多模式数据挖掘查询,并且优于文献中许多根本无法扩展的现有多模式数据挖掘方法。对于现实世界的图像数据库,报告了与最新的多模式数据挖掘方法的性能比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号