首页> 外文期刊>Multimedia Tools and Applications >Modelling multilevel data in multimedia: A hierarchical factor analysis approach
【24h】

Modelling multilevel data in multimedia: A hierarchical factor analysis approach

机译:在多媒体中建模多级数据:一种分层因素分析方法

获取原文
获取原文并翻译 | 示例
           

摘要

Multimedia content understanding research requires rigorous approach to deal with the complexity of the data. At the crux of this problem is the method to deal with multilevel data whose structure exists at multiple scales and across data sources. A common example is modeling tags jointly with images to improve retrieval, classification and tag recommendation. Associated contextual observation, such as metadata, is rich that can be exploited for content analysis. A major challenge is the need for a principal approach to systematically incorporate associated media with the primary data source of interest. Taking a factor modeling approach, we propose a framework that can discover low-dimensional structures for a primary data source together with other associated information. We cast this task as a subspace learning problem under the framework of Bayesian nonparametrics and thus the subspace dimensionality and the number of clusters are automatically learnt from data instead of setting these parameters a priori. Using Beta processes as the building block, we construct random measures in a hierarchical structure to generate multiple data sources and capture their shared statistical at the same time. The model parameters are inferred efficiently using a novel combination of Gibbs and slice sampling. We demonstrate the applicability of the proposed model in three applications: image retrieval, automatic tag recommendation and image classification. Experiments using two real-world datasets show that our approach outperforms various state-of-the-art related methods.
机译:多媒体内容理解研究需要严格的方法来处理数据的复杂性。这个问题的症结在于处理多层次数据的方法,这些数据的结构存在于多个尺度和跨数据源。一个常见的示例是将标签与图像一起建模以改善检索,分类和标签推荐。关联的上下文观察(例如元数据)非常丰富,可以用于内容分析。一个主要的挑战是需要一种主要的方法来将相关的媒体与感兴趣的主要数据源系统地结合在一起。采用因素建模方法,我们提出了一个框架,该框架可以发现主要数据源的低维结构以及其他相关信息。我们将此任务作为贝叶斯非参数框架下的子空间学习问题,因此可以自动从数据中学习子空间维数和簇数,而无需事先设置这些参数。使用Beta流程作为构建块,我们以分层结构构造随机度量,以生成多个数据源并同时捕获它们的共享统计信息。使用Gibbs和切片采样的新颖组合可以有效地推断模型参数。我们证明了该模型在三个应用中的适用性:图像检索,自动标签推荐和图像分类。使用两个真实数据集进行的实验表明,我们的方法优于各种最新的相关方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号