【24h】

Preface

机译:前言

获取原文

摘要

In the late years Deep Learning has been a great force of change on most Computer Vision and Multimedia tasks. In video analysis problems, however, such as action recognition and detection, motion analysis and tracking, shallow architectures remain surprisingly competitive. Assuming that the recently proposed video datasets are large enough for training deep networks for video, another likely culprit for this standstill could be the capacity of the existing deep models. More specifically, the existing deep networks for video analysis might not be sophisticated enough to address the complexity of motion information. This makes sense, as videos introduce an exponential complexity as compared to static images. Unfortunately, state-of-the-art motion representation models are extensions of existing image representations rather than motion dedicated ones. Brave, new and motion-specific representations are likely to be needed for a breakthrough in video analysis.
机译:在深度学习的深度学习中,大多数计算机愿景和多媒体任务的变化一直是巨大的变化。然而,在视频分析问题中,例如动作识别和检测,运动分析和跟踪,浅架构仍然令人惊讶。假设最近提出的视频数据集足够大用于训练视频的深度网络,这可能是这种静止的罪魁祸首可能是现有深层模型的容量。更具体地,用于视频分析的现有深度网络可能不足以满足运动信息的复杂性。这是有道理的,因为视频与静态图像相比引入指数复杂性。遗憾的是,最先进的运动表示模型是现有图像表示的扩展,而不是运动专用。视频分析中的突破可能需要勇敢,新的和特定于运动特定的表示。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号