Multi-label video classification via coupling attentional multiple instance learning with label relation graph

Li Xuewei; Wu Hongjun; Li MengzhuLiu Hongzhe

首页> 外文期刊>Pattern recognition letters >Multi-label video classification via coupling attentional multiple instance learning with label relation graph

【24h】

Multi-label video classification via coupling attentional multiple instance learning with label relation graph

机译：Multi-label video classification via coupling attentional multiple instance learning with label relation graph

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相关主题

摘要

Multi-label video classification is a challenging problem in pattern recognition field, as it is difficult to grasp the occurring localizations of a huge number of labels in videos. To solve this problem, we propose a general framework named MALL-CNN, i.e., Multi-Attention Label Relation Learning Convolutional Neural Network. MALL-CNN not only builds the correspondences between labels and videos by an attention mechanism, but also captures label co-occurrence by a graph learning approach. Specifically, we introduce multiple instance learning to composite a set of frame-level features into a video-level feature. Then, video-level feature is mapped into the content-aware category representations in an improved attentional manner. Further, these representations are enhanced by a series of label relation graphs, which transform global label relationships to the label relationships of current video. With the three processes, frame feature aggregation, video feature mapping, and label relationship construction can be achieved in MALL-CNN for multi-label video classification. Extensive experiments on real-world scene benchmark Youtube-8M verify that MALL-CNN with only frame feature surpasses the state of the arts with multi modal features as well as ensemble models.(c) 2022 Elsevier B.V. All rights reserved.

著录项

来源
《Pattern recognition letters》 |2022年第4期|53-59|共7页
作者
Li Xuewei; Wu Hongjun; Li MengzhuLiu Hongzhe;
展开▼
作者单位

Beijing Union Univ, Beijing Key Lab Informat Serv Engn, Beijing 100101, Peoples R China;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种英语
中图分类
关键词
Multi-label video classification; Multiple instance learning; Attentional feature learning; Label relation graph;

Multi-label video classification via coupling attentional multiple instance learning with label relation graph

摘要

著录项

相关主题

期刊订阅