首页> 外文会议>Conference on Multimedia Information Processing and Retrieval >A Multi-label Multimodal Deep Learning Framework for Imbalanced Data Classification
【24h】

A Multi-label Multimodal Deep Learning Framework for Imbalanced Data Classification

机译:用于不平衡数据分类的多标签多模式深度学习框架

获取原文

摘要

Social media and Web services have provided a notable number of multimedia content. Due to such explosion of multimedia data, the multimedia community has been facing new challenges and exciting opportunities these days. This paper presents a new multimedia framework to address some of the main challenges in this area. In particular, it presents a multi-label multimodal framework for imbalanced data classification. For this purpose, it utilizes audio, visual, and textual data modalities and automatically generates static and temporal features using spatio-temporal deep neural networks. It also manages data with non-uniform distributions using a weighted multi-label classifier. To evaluate this framework, a video dataset containing natural disasters is used for multi-label classification. The supremacy of the proposed framework compared to the existing work is revealed with extensive experiments on this dataset.
机译:社交媒体和Web服务已经提供了大量的多媒体内容。由于多媒体数据的爆炸式增长,近来多媒体社区一直面临着新的挑战和令人兴奋的机遇。本文提出了一个新的多媒体框架,以解决该领域的一些主要挑战。特别是,它提出了一种用于不平衡数据分类的多标签多模式框架。为此,它利用音频,视觉和文本数据形式,并使用时空深度神经网络自动生成静态和时间特征。它还使用加权的多标签分类器管理具有非均匀分布的数据。为了评估此框架,将包含自然灾害的视频数据集用于多标签分类。通过对该数据集进行大量实验,揭示了与现有工作相比拟议框架的至高无上性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号