Multimodal Data Mining in a Multimedia Database Based on Structured Max Margin Learning

Guo Zhen; Zhang Zhongfei (Mark); Xing Eric P.; Faloutsos Christos

首页> 外文期刊>ACM transactions on knowledge discovery from data >Multimodal Data Mining in a Multimedia Database Based on Structured Max Margin Learning

【24h】

Multimodal Data Mining in a Multimedia Database Based on Structured Max Margin Learning

机译：基于结构最大余量学习的多媒体数据库中的多模式数据挖掘

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Mining knowledge from a multimedia database has received increasing attentions recently since huge repositories are made available by the development of the Internet. In this article, we exploit the relations among different modalities in a multimedia database and present a framework for general multimodal data mining problem where image annotation and image retrieval are considered as the special cases. Specifically, the multimodal data mining problem can be formulated as a structured prediction problem where we learn the mapping from an input to the structured and interdependent output variables. In addition, in order to reduce the demanding computation, we propose a new max margin structure learning approach called Enhanced Max Margin Learning (EMML) framework, which is much more efficient with a much faster convergence rate than the existing max margin learning methods, as verified through empirical evaluations. Furthermore, we apply EMML framework to develop an effective and efficient solution to the multimodal data mining problem that is highly scalable in the sense that the query response time is independent of the database scale. The EMML framework allows an efficient multimodal data mining query in a very large scale multimedia database, and excels many existing multimodal data mining methods in the literature that do not scale up at all. The performance comparison with a state-of-the-art multimodal data mining method is reported for the real-world image databases.

机译：由于因特网的发展提供了巨大的存储库，最近从多媒体数据库中挖掘知识已受到越来越多的关注。在本文中，我们利用多媒体数据库中不同模态之间的关系，提出了一个通用的多模态数据挖掘问题的框架，其中图像注释和图像检索被视为特例。具体来说，可以将多模式数据挖掘问题表述为结构化预测问题，在该问题中，我们将学习从输入到结构化且相互依赖的输出变量的映射。此外，为了减少计算量，我们提出了一种新的最大余量结构学习方法，称为增强最大余量学习（EMML）框架，与现有的最大余量学习方法相比，该方法效率更高，收敛速度也更快。通过经验评估得到验证。此外，在查询响应时间与数据库规模无关的意义上，我们应用EMML框架开发了一种高度可扩展的多模式数据挖掘问题的有效解决方案。 EMML框架允许在非常大型的多媒体数据库中进行有效的多模式数据挖掘查询，并且优于文献中许多根本无法扩展的现有多模式数据挖掘方法。对于现实世界的图像数据库，报告了与最新的多模式数据挖掘方法的性能比较。

著录项

来源
《ACM transactions on knowledge discovery from data》 |2016年第3期|23.1-23.30|共30页
作者
Guo Zhen; Zhang Zhongfei (Mark); Xing Eric P.; Faloutsos Christos;
展开▼
作者单位

SUNY Binghamton, Binghamton, NY 13902 USA|SUNY Binghamton, Dept Comp Sci, Binghamton, NY 13902 USA;

SUNY Binghamton, Binghamton, NY 13902 USA|SUNY Binghamton, Dept Comp Sci, Binghamton, NY 13902 USA;

Carnegie Mellon Univ, Pittsburgh, PA 15213 USA|Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA;

Carnegie Mellon Univ, Pittsburgh, PA 15213 USA|Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Algorithms; Experimentation; Multimodal data mining; image annotation; image retrieval; max margin;

机译：算法;实验;多峰数据挖掘;图像标注;图像检索;最大余量;

相似文献

外文文献
中文文献
专利

1. A modified K-means clustering for mining of multimedia databases based on dimensionality reduction and similarity measures [J] . Xiaoping Jiang, Chenghua Li, Jing Sun Cluster computing . 2018,第1期

机译：基于维数减少和相似度措施的多媒体数据库挖掘修改的k均值聚类
2. Generalized affinity-based association rule mining for multimedia database queries [J] . Mei-Ling Shyu, Shu-Ching Chen, R. L. Kashyap Knowledge and information systems . 2001,第3期

机译：多媒体数据库查询的基于通用相似度的关联规则挖掘
3. Generalized Affinity-Based Association Rule Mining for Multimedia Database Queries [J] . Mei-Ling Shyu, Shu-Ching Chen, R. L. Kashyap Knowledge and Information Systems . 2001,第3期

机译：多媒体数据库查询的基于相似度的关联规则挖掘
4. Enhanced Max Margin Learning on Multimodal Data Mining in a Multimedia Database [C] . Zhen Guo, Zhongfei (Mark) Zhang, Eric P. Xing, ACM SIGKDD International Conference on Knowledge Discovery and Data Mining; 20070812-15; San Jose,CA(US) . 2007

机译：多媒体数据库中多模式数据挖掘的增强最大余量学习
5. Multimedia data mining and retrieval for multimedia databases using associations and correlations. [D] . Lin, Lin. 2010

机译：使用关联和相关性对多媒体数据库进行多媒体数据挖掘和检索。
6. Maximum Mean Discrepancy Based Multiple Kernel Learning for Incomplete Multimodality Neuroimaging Data [O] . Xiaofeng Zhu, Kim-Han Thung, Ehsan Adeli, -1

机译：基于最大均值差异的不完全多模态神经影像数据的多核学习
7. Enhanced max margin learning on multimodal data mining in a multimedia database [O] . Zhen Guo, Zhongfei (mark Zhang 2007

机译：多媒体数据库中多模式数据挖掘的增强最大余量学习
8. Sparse Representation of Multimodality Sensing Databases for Data Mining and Retrieval. [R] . Hero, A. O., Savarese, S. 2015

机译：用于数据挖掘和检索的多模态传感数据库的稀疏表示。

Multimodal Data Mining in a Multimedia Database Based on Structured Max Margin Learning

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅