Sequential Deep Learning for Disaster-Related Video Classification

机译：顺序深度学习用于与灾难相关的视频分类

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Videos serve to convey complex semantic information and ease the understanding of new knowledge. However, when mixed semantic meanings from different modalities (i.e., image, video, text) are involved, it is more difficult for a computer model to detect and classify the concepts (such as flood, storm, and animals). This paper presents a multimodal deep learning framework to improve video concept classification by leveraging recent advances in transfer learning and sequential deep learning models. Long Short-Term Memory (LSTM) Recurrent Neural Networks (RNN) models are then used to obtain the sequential semantics for both audio and textual models. The proposed framework is applied to a disaster-related video dataset that includes not only disaster scenes, but also the activities that took place during the disaster event. The experimental results show the effectiveness of the proposed framework.

机译：视频可传达复杂的语义信息并简化对新知识的理解。但是，当涉及来自不同形式（即图像，视频，文本）的混合语义时，计算机模型更难检测和分类概念（例如洪水，暴风雨和动物）。本文提出了一种多模式深度学习框架，以利用转移学习和顺序深度学习模型的最新进展来改进视频概念分类。然后，使用长短期记忆（LSTM）递归神经网络（RNN）模型来获得音频和文本模型的顺序语义。所提出的框架被应用于与灾难有关的视频数据集，该数据集不仅包括灾难现场，还包括灾难事件期间发生的活动。实验结果表明了该框架的有效性。

著录项

来源
《2018 IEEE Conference on Multimedia Information Processing and Retrieval》|2018年|106-111|共6页
会议地点 Miami(US)
作者
Haiman Tian; Hector Cen Zheng; Shu-Ching Chen;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Machine learning; Feature extraction; Predictive models; Data models; Semantics; Computational modeling; Recurrent neural networks;

机译：机器学习;特征提取;预测模型;数据模型;语义;计算模型;递归神经网络;;

相似文献

外文文献
中文文献
专利

1. Deep Learning Networks-Based Action Videos Classification and Search [J] . Wang Wenshi, Huang Zhangqin, Tian Rui International Journal of Pattern Recognition and Artificial Intelligence . 2021,第7期

机译：基于深度学习网络的动作视频分类和搜索
2. Physical Features and Deep Learning-based Appearance Features for Vehicle Classification from Rear View Videos [J] . IEEE Transactions on Intelligent Transportation Systems . 2020,第3期

机译：从后视视频中进行车辆分类的物理特征和基于深度学习的外观特征
3. Deep Learning Based Target Tracking and Classification for Infrared Videos Using Compressive Measurements [J] . Chiman Kwan, Bryan Chou, Jonathan Yang, Journal of Signal and Information Processing . 2019,第4期

机译：使用压缩测量的基于深度学习的红外视频目标跟踪和分类
4. Sequential Deep Learning for Disaster-Related Video Classification [C] . Haiman Tian, Hector Cen Zheng, Shu-Ching Chen IEEE Conference on Multimedia Information Processing and Retrieval . 2018

机译：与灾害相关的视频分类顺序深度学习
5. Deep Learning Based Multi-Label Classification for Surgical Tool Presence Detection in Laparoscopic Videos [D] . Raju, Ashwin. 2017

机译：基于深度学习的多标签分类用于腹腔镜视频中的手术工具存在检测
6. Deep-Learning-Based Multimodal Emotion Classification for Music Videos [O] . Yagya Raj Pandeya, Bhuwan Bhattarai, Joonwhoan Lee 2021

机译：基于深度学习的音乐视频的多模式情感分类
7. Automated Classification of Blood Loss from Transurethral Resection of the Prostate Surgery Videos Using Deep Learning Technique [O] . Jian-Wen Chen, Wan-Ju Lin, Chun-Yuan Lin, 2020

机译：使用深度学习技术自动分尿道切除前列腺手术视频中的失血分类

Sequential Deep Learning for Disaster-Related Video Classification

摘要

著录项

相似文献

相关主题

期刊订阅