首页> 外文会议>International Conference on Pattern Recognition >Video Representation Fusion Network For Multi-Label Movie Genre Classification

【24h】

Video Representation Fusion Network For Multi-Label Movie Genre Classification

机译：用于多标签电影类型分类的视频表示融合网络

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we introduce a Video Representation Fusion Network (VRFN) for movie genre classification. Different from the previous works, which use frame-level features for movie genre classification, our approach uses video classification architecture to create video-level features from a group of frames and fuse these features temporally to learn long-term spatiotemporal information for the movie genre classification task. We use a pre-trained I3D model to generate intermediate video representations and connect it with a C3D-LSTM model for feature fusion and movie genre classification. LMTD-9 dataset which contains 4007 trailers multi-labeled with 9 movie genres is used for training and evaluation of the model. The experimental results demonstrate that learning long-term temporal dependencies by fusing video representations improves the performance in movie genre classification. Our best model outperforms state-of-the-art methods by 3.4% improvement in AUPRC(macro).

机译：在本文中，我们介绍了一种用于电影类型分类的视频表示融合网络（VRFN）。与以前的作品不同，它使用电影类型分类的帧级功能，我们的方法使用视频分类架构从一组帧中创建视频级功能，并融合这些功能，以便为电影类型学习长期的时空信息分类任务。我们使用预先训练的I3D模型来生成中间视频表示，并将其与C3D-LSTM模型连接，用于特征融合和电影类型分类。 LMTD-9包含4007拖车的数据集用9部电影流派的多标签用于培训和评估该模型。实验结果表明，通过融合视频表示学习长期的时间依赖性提高了电影类型分类中的性能。我们最好的车型优于最先进的方法，通过Auprc（宏）的改进3.4％。

著录项

来源
《International Conference on Pattern Recognition 》|2021年|9386-9391|共6页
会议地点
作者
Tianyu Bi; Dmitri Jarnikov; Johan Lukkien;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Training; Fuses; Motion pictures; Spatiotemporal phenomena; Pattern recognition; Task analysis;

机译：培训;保险丝;运动图片;时尚现象;模式识别;任务分析;

相似文献

外文文献
中文文献
专利

1. Movie genre classification: A multi-label approach based on convolutions through time [J] . Wehrmann Jonatas, Barros Rodrigo C. Applied Soft Computing . 2017 ,第期

机译：电影类型分类：基于时间的卷积的多标签方法
2. Multi-label semantic concept detection in videos using fusion of asymmetrically trained deep convolutional neural networks and foreground driven concept co-occurrence matrix [J] . Janwe Nitin J., Bhoyar Kishor K. Applied Intelligence: The International Journal of Artificial Intelligence, Neural Networks, and Complex Problem-Solving Technologies . 2018 ,第8期

机译：使用非对称训练的深卷积神经网络和前景驱动概念共发生矩阵的视频中的多标签语义概念检测
3. Parallel neural networks for multimodal video genre classification [J] . Maurizio Montagnuolo, Alberto Messina Multimedia Tools and Applications . 2008 ,第2期

机译：并行神经网络用于多模式视频流派分类
4. Self-Attention for Synopsis-Based Multi-Label Movie Genre Classification [C] . Jonatas Wehrmann, Mauricio A. Lopes, Rodrigo C. Barros International Florida Aritificial Intelligence Research Society Conference . 2018

机译：基于纲领的多标签电影类型分类的自我关注
5. Deep Learning Based Multi-Label Classification for Surgical Tool Presence Detection in Laparoscopic Videos [D] . Raju, Ashwin. 2017

机译：基于深度学习的多标签分类用于腹腔镜视频中的手术工具存在检测
6. ML-Net: multi-label classification of biomedical texts with deep neural networks [O] . Jingcheng Du, Qingyu Chen, Yifan Peng, 2019

机译：ML-NET：具有深神经网络的生物医学文本的多标签分类
7. Genre Classification of Telugu and English Movie Based on the Hierarchical Attention Neural Network [O] . Kumar Govindaswamy, Shriram Ragunathan 2021

机译：基于分层关注神经网络的Telugu和英语电影的流派分类

Video Representation Fusion Network For Multi-Label Movie Genre Classification

摘要

著录项

相似文献

相关主题

期刊订阅