机译:在混合深度学习框架中为视频分类建模多峰线索
School of Computer Science, Fudan University, Shanghai, China;
School of Computer Science, Fudan University, Shanghai, China;
School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing, China;
School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing, China;
School of Computer Science, Fudan University, Shanghai, China;
Department of Electrical Engineering, Columbia University, New York City, NY, USA;
Semantics; Feature extraction; Hidden Markov models; Machine learning; Optical imaging; Context modeling; Three-dimensional displays;
机译:基于深度学习的音乐视频情感分类的多峰信息深融合
机译:用于视频分类的多模式深度表示学习
机译:视频分类的多模式深度代表学习
机译:结合深度学习模型和迁移学习的体育视频分类
机译:使用具有音频,视频和生物医学传感器的深度学习模型,对说话人和情感识别进行多模式传感和数据处理
机译:基于深度学习的音乐视频的多模式情感分类
机译:在视频混合深度学习框架中建模多模态线索 分类