首页> 外文会议>International Workshop on Content-Based Multimedia Indexing >Online multimodal matrix factorization for human action video indexing

【24h】

Online multimodal matrix factorization for human action video indexing

机译：在线多模态矩阵分解用于人体动作视频索引

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper addresses the problem of searching for videos containing instances of specific human actions. The proposed strategy builds a multimodal latent space representation where both visual content and annotations are simultaneously mapped. The hypothesis behind the method is that such a latent space yields better results when built from multiple data modalities. The semantic embedding is learned using matrix factorization through stochastic gradient descent, which makes it suitable to deal with large-scale collections. The method is evaluated on a large-scale human action video dataset with three modalities corresponding to action labels, action attributes and visual features. The evaluation is based on a query-by-example strategy, where a sample video is used as input to the system. A retrieved video is considered relevant if it contains an instance of the same human action present in the query. Experimental results show that the learned multimodal latent semantic representation produces improved performance when compared with an exclusively visual representation.

机译：本文解决了搜索包含特定人类行为实例的视频的问题。所提出的策略建立了多模态潜在空间表示，其中视觉内容和注释都被同时映射。该方法背后的假设是，当从多种数据模式构建时，这样的潜在空间会产生更好的结果。通过随机梯度下降使用矩阵分解来学习语义嵌入，这使其适合处理大规模集合。在具有三种与动作标签，动作属性和视觉特征相对应的模态的人类动作视频数据集上对该方法进行了评估。评估基于示例查询策略，其中将示例视频用作系统的输入。如果检索到的视频包含查询中存在的相同人工动作的实例，则认为该视频是相关的。实验结果表明，与专门的视觉表示相比，学习到的多峰潜在语义表示产生了更高的性能。

著录项

来源
《International Workshop on Content-Based Multimedia Indexing 》|2014年|1-6|共6页
会议地点
作者
Paez Fabian; Vanegas Jorge A.; Gonzalez Fabio A.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Matrix factorization; human actions; information retrieval; latent space; multimodal data; query by example; video processing;

机译：矩阵分解;人为行为;信息检索;潜在空间;多峰数据;实例查询;视频处理;

相似文献

外文文献
中文文献
专利

1. Multimodal representation, indexing, automated annotation and retrieval of image collections via non-negative matrix factorization [J] . Juan C. Caicedo, Jaafar BenAbdallah, Fabio A. Gonzalez, Neurocomputing . 2012 ,第1期

机译：通过非负矩阵分解实现多模式表示，建立索引，自动注释和检索图像集
2. Predicting Protein–Protein Interactions from Multimodal Biological Data Sources via Nonnegative Matrix Tri-Factorization [J] . HUA WANG, HENG HUANG, CHRIS DING, Journal of computational biology: A journal of computational molecular cell biology . 2013 ,第4期

机译：通过非负矩阵三因子预测多模式生物数据来源的蛋白质-蛋白质相互作用
3. Predicting Protein–Protein Interactions from Multimodal Biological Data Sources via Nonnegative Matrix Tri-Factorization [J] . Feiping Nie, Hua Wang Heng Huang Chris Ding Journal of computational biology . 2013 ,第4期

机译：通过非负矩阵三因子预测多模式生物数据来源的蛋白质-蛋白质相互作用
4. Online multimodal matrix factorization for human action video indexing [C] . Paez Fabian, Vanegas Jorge A., Gonzalez Fabio A. International Workshop on Content-Based Multimedia Indexing . 2014

机译：用于人类行动视频索引的在线多模式矩阵分解
5. Multimodal Indexing of Presentation Videos [D] . Merler, Michele 2013

机译：演示视频的多模式索引
6. Predicting synthetic lethal interactions in human cancers using graph regularized self-representative matrix factorization [O] . Jiang Huang, Min Wu, Fan Lu, 2019

机译：使用图正则化自代表矩阵分解预测人类癌症中的合成致死相互作用
7. 1 Collaborative Video Re-indexing via Matrix Factorization [O] . Ming-fang Weng, Yung-yu Chuang 2013

机译：1通过矩阵分解的协作视频重新索引
8. Method and Device for Online Dynamic Semantic Video Compression and Video Indexing. [R] . Liu, T., Kender, J. R. 2004

机译：动态语义视频压缩和视频索引。

Online multimodal matrix factorization for human action video indexing

摘要

著录项

相似文献

相关主题

期刊订阅